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GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 10:47:27 ; Search time 5308.15 Seconds 

(without alignments) 
10489.161 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-10-017-372E-38 
1361 

1 tggtaccgagatggcgcctt cgattaaagcggccgcgact 1361 

I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 
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ALIGNMENTS 



RESULT 1 

PIGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



gene 
CDS 



MAM 31-MAR-1995 
complete cds . 



PIGTGFB1A 32 06 bp mRNA linear 

Sus scrofa transforming growth factor beta-1 mRNA, 
M23703 

M23703 . 1 GI :755044 
transforming growth f actor-beta-1 . 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; 
1 (bases 1 to 3206) 

Kondaiah,P., Van Obberghen-Schill ing , E . , Ludwig,R.L. 
Sporn,M.B. and Roberts, A. B. 

cDNA cloning of porcine transforming growth factor-beta 1 mRNAs 
Evidence for alternate splicing and polyadenylat ion 
J. Biol. Chem. 263 (34), 18313-18317 (1988) 
89054010 
2461367 
On Apr 1 
Original 
mRNA. 



Euteleostomi ; 
Sus . 

, Dhar, R. , 



1995 this sequence version replaced gi: 341017. 
source text: Sus scrofa (strain miniature swine) 



cDNA to 



FEATURES 

source 



Location/ Qual i f lers 
1. .3206 

/organism="Sus scrofa" 
/mol_type="mRNA" 
/strain="miniature swine" 
/db_xref ="taxon: 98 23" 

/cell__type= "peripheral blood lymphocyte" 
1. .3206 

/gene="TGF-beta-l" 
906. .2078 
/gene=" TGF-beta-1" 
/codon_start=l 

/product="transforming growth factor-beta- 1 " 
/protein_id="AAA64616 . 1" 
/db_xref="GI : 755045" 

/ trans 1 at ion= "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 



polyA_signal 
polyA_site 



BASE COUNT 
ORIGIN 



645 a 



KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 

LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 

RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 

QYSKVIALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS " 

3186. .3191 

/gene= n TGF-beta-l" 

3206 

/gene="TGF-beta-l" 

1041 c 924 g 596 t 



Query Match 90 . 0%; 

Best Local Similarity 95.4%; 
Matches 1292; Conservative 



Score 1224.4; DB 4; Length 3206; 
Pred. No. 2.5e-212; 
0; Mismatches 11; Indels 51; Gaps 



1; 
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Qy 
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Qy 


426 


Db 
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Qy 


486 


Db 


1381 


Qy 


546 


Db 


1441 


Qy 


606 



C CGAGATGG CGC CTT CGGGG CTG CGG CT CTTG CCG CTG CTG CTG CCG CTG CTGTGGCTG C 

II III IIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIII 

CCCCCATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 



65 



960 



125 



TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

1 1 1 II I M 1 1 1 1 1 1 M II I M I II I Ml M I 1 1 1 M I II 1 1 1 M I II I II 1 1 1 1 1 Ml 

TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 102 0 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 1 1 in ii 1 1 1 1 1 ii 1 1 1 1 ii i in 1 1 1 .i i i i ii r i ii 1 1 1 1 ii ii 1 1 1 1 .M 1 1 1 1 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 108 0 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I I III II 1 1 1 M IM 1 1 1 II 1 1 1 1 M I 1 1 1 M I Ml II 1 1 1 Ml II II I IIIIIM 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 114 0 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

Ml M 1 1 1 II 1 1 1 1 1 1 1 1 II M III III 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 I 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 12 00 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 II 1 1 M I II 1 1 1 1 M II 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 12 6 0 
AAATCTATGATAAATT(^GGGCACCCCCC^CAGCTTATATATGCTGTTC^(^CGTCGG 4 25 

1 1 1 1 II I II 1 1 1 1 i I M M I M 1 1 1 1 M I M I M I Ml II I M Ml MM 1 1 1 1 M I II 

AAATCTATGATAAATT(^GGGCACCCCCCA(^GCTTATATATGCTGTT(^CACGTCGG 1320 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

M 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 M II M I II 1 1 1 M 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 M I 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 13 8 0 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATAC^GCAATGATTCCT 54 5 

I I M M 1 1 ! 1 1 1 i 1 1 ' 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 : M 1 1 M i II I I! : 1 1 1 1 M I 

GGCTC^GTTAAAAGTGGAGCAGC^CGTGGAGCTATACCAGAAATACAGCAATGATTCCT 14 4 0 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 05 

IMIMI IIIIIM IIIMIMI IMMIMIIMIM IMIIIMIIIIMI I 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 15 0 0 



ATGTCACCGGAGTTGTG CGG CAGTGG CTG A CCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

II I II I II 1 1 1 1 II 1 1 1 II M I III I M 1 1 II I III M M I Ml II 1 1 1 1 1 1 M I I 



665 



Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



1501 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 1560 



666 



725 



TCAGTGCGCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

IIIIIMI MMMMMMMIIMM MMMMMIMM IMMIIIIIIII 

1561 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 162 0 



726 



785 



TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

I i 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 ! I 1 1 1 i 1 1 ! 1 1 1 1 1 1 1 ! ! I 1 1 1 1 i I ! 1 1 1 ! i 

1621 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 168 0 
786 TGCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

IIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

1681 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA- 173 9 
84 6 CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 



1740 



906 



-GCCCTGGATA 174 9 



965 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 

i m 1 1 1 1 1 i 1 1 1 1 1 1 I ii i ii ; 1 1 1 1 1 1 : i ii 1 1 1 :i: 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 Mi 

17 50 CCAACTACTGCTTCAGCTCCAGGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 18 09 

966 TCCGGAAGGAGCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 102 5 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
1810 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACGCAAGGGCTACCATGCCAATTTCT 18 6 9 

1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTG 1085 

IIIIIMMIIIIIIIIMIIMIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIMIII 

1870 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 192 9 

1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

MMMMMMMMIIMMMIMMIIMMMMMMMMMMMMMMI 
193 0 TGTACAACCAGCACAACCGGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 198 9 

114 6 AGCC^CTGCCCATCGTGTACTACGTGGGGCGC^GCCCAAGGTGGAGCAGGTGTCCAACA 1205 

II 1 1 II M II 1 1 1 MM M I M II II III M II 1 1 1 1 III III II I M II 1 1 1 II M 1 1 

1990 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 2 04 9 
12 06 TGATGGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGGCCCGCCCACCC 1265 

1 1 1 1 1 1 M 1 1 1 1 1 II I II Ml I II 1 1 1 1 1 1 M 1 1 1 1 III MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

2 050 TGATGGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 2109 
1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCC 1325 

II 1 1 1 M I 1 1 1 1 MM II I II II M 1 1 1 M II 1 1 II III MM II M II 1 1 1 II M 1 1 

2110 GGCAGGCCCGGCCCCACCGCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCC 2169 
1326 CAAGGCCACTTGGGATGGATTAAAGCGGCCGCGA 135 9 

MMMMMMMMMMMMI I I I I 

2170 CAAGCCCACTTGGGATCGATTAAAGGTGGAGAGA 2 2 03 



RESULT 2 
AX338213 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 



AX338213 
Sequence 
AX338213 
AX338213 . 



1326 bp 
1 from Patent WO0181404, 

1 GI: 18128750 



DNA 



linear PAT 09 -JAN- 2 002 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

1 

Strober,W., Nakamura , K . , Kitani,A. and Fuss, I. J. 

Inducible plasmid vector encoding tgf-_g(b) and uses thereof 

Patent: WO 0181404-A 1 01-NOV-2001; 

THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES (US) 
Loca t ion/ Qua 1 i f 1 er s 
1. .1326 

/organ i sm= " Sus scro fa " 
/mol_type= "genomic DNA" 
/db_xref="taxon: 9823" 
16. .1188 

/not e=" unnamed protein product" 
/codon_start=l 
/protein_id="CAD20538 . 1" 
/db_xref="GI : 18128751" 

/ 1 rans 1 a t i on = " MAPSGLRLLPLLLPLLWLLVLTPGR PAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
263 a 438 c 392 g 233 t 



Query Match 88.8%; 
Best Local Similarity 95.4%; 
Matches 1297; Conservative 



Score 1208.6; DB 6; 
Pred. No. 2e-209; 
0; Mismatches 9; 



Length 1326; 
Indels 53; Gaps 



3; 
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61 
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66 


Qy 


121 


Db 


126 
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181 


Db 


186 


Qy 


241 


Db 


246 


Qy 


301 


Db 


306 



TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 

1 1 1 : i ii in 1 1 1 1 ii 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 iii 1 1 1 in 1 1 M n i m ii 

TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 



60 



65 



120 



GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 

1 1 1 1 1 1 1 III II M M 1 1 1 1 ' I II 1 1 ' I II 1 1 ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 MM 1 1 II I M 

GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 125 
CATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCT 18 0 

II 1 1 1 1 MM I III III 1 1 1 1 M 1 1 1 MM 1 1 1 M 1 1 II III 1 1 1 1 1 M I I M 1 1 1 1 II 

CATGGAG CTGGTGAAG CGGAAG CG CAT CGAGG CCATT CG CGG C CAGATT CTG T CCAAGCT 18 5 
TCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGT 24 0 

mum 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 r m 

TCGGCTCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGT 24 5 
ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 300 

I II M 1 1 IMM M I M I II M II I Ml I II I Ml II I III 1 1 1 1 1 1 1 1 M II I III I 

ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 3 05 
CGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGG 360 

M 1 1 1 1 1 III I II I II 1 1 II 1 1 1 Ml II II 1 1 II I M I Ml 1 1 II 1 1 1 1 II I II II Ml 

CGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGG 365 



Qy 


361 


Db 


366 


Qy 


421 


Db 


426 


Qy 


481 


Db 


486 


Qy 


541 


Db 


546 


Qy 


601 


Db 


606 


Qy 


661 


Db 


666 


Qy 


721 


Db 


726 


Qy 


781 


Db 


786 


Qy 


841 


Db 


846 


Qy 


901 


Db 


855 


Qy 


961 


Db 


915 


Qy 


1021 


Db 


975 


Qy 


1081 


Db 


1035 


Qy 


1141 


Db 


1095 



CAACCA^TCTATGATAAATTC^GGGCACCCCC(^CAGCTTATATATGCTGTTC^CAC 42 0 

III II M MM 1 1 1 M 1 1 MM MM III I III II I II I II III M I II 1 1 1 M II Ml 

CAACGAAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCT^ 42 5 

GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 4 8 0 

I II I l : li Ml i II ill 1 1 iiiiiiii ii I M ill iiiiiiiiiiii :m hi ii I i 

GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 4 8 5 
G CTGAGG CTCAAGTTAAAAGTGGAG CAG CACGTGGAG CTATACCAGAAATA GAG CAATGA 54 0 

III I MM MM M MM MMIMIMI I II III MIMMMIM III III M Ml 

GCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGA 54 5 
TTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTC 600 

I MM IMMIM IMIM I III Mill Mill III IIIIIIIIIIII III Ml II I I 

TTGCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCAGCGGAGTGGCTGTC 6 05 
CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 660 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMI Ml 

CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 665 
TCGCCTCAGTGCGCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAA 72 0 

MMMMMMMMI MMI M M M M M M I M M M M M M M I M M I M I 

TCGCGTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAA 72 5 
CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGGCCTT 78 0 

I II IMMIM M Mill I III Mill II III Ml MIMMMIM III III M III 

CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTT 78 5 
CCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCGGGGCACCG 84 0 

MMMMMMMMMMMMMMM MMMMMMMMMMM MMMM 

CCTGGTGCTCATGGCCAGCCGGCTGGAGAGGGCCCAGCAGCTGCACAGCTCCCGGCACCG 84 5 
CCGAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGGCCT 900 

MM MMI 

CCGA GCCCT 854 

GGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACAT 96 0 

III Mill MM Mill I III Mill II I II III IMIII Ml MM III III II Ml 

GGATACCAACTACTGCTTCAGCTCCA.CGGAGAAGAACTGCTGCGTGCGGCAGCTCTACAT 914 
TGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTC^TGAACCCAAGGGCTACCATGCCAA 102 0 

1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E I M 1 1 11 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAA 974 
TTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCT 108 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCT 1034 
GGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGC 114 0 

1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGC 1094 
GCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTC 12 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTC 1154 



Qy 12 01 CAACATGATCGTGCGTTCCTC 12 6 0 

! 1 1 I M II ! I M I Ml 1 1 1 1! Ill 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1! 1 1 1 

Db 1155 C^ACATGATCGTGCGTTCCTGCAAGTGCAGCTGA-GCCCCGCCCCGCCCACAGCCCCGCC 1213 

Qy 1261 CACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCG 132 0 

Mill I'l III IIIMIMM I II III III III llllllllll INI I hll, I Mill 

Db 1214 CACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCG 1273 

Qy 1321 TG C C C CAAG CC CA CTTGGG ATCG ATTAAAG CGG C CG CG A 13 5 9 

1 1 1 1 1 1 1 1 1 1 M m 1 1 1 1 1 ; i j 1 1 1 1 1 . i i 1 1 

Db 1274 TGCCCCAAGCCCAC - TGGGATCGATTAAAGGTGGAGAGA 1311 



RESULT 3 

GGTGFB1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



gene 
5 'UTR 
CDS 



Euteleostomi ; 
Sus . 



GGTGFB1 1750 bp mRNA linear MAM 27-MAR-1996 

Porcine mRNA for transforming growth factor-beta 1. 

X12373 

X12373.1 GI:63808 

transforming growth factor-beta 1. 

Sus scrofa (pig) 

Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; 

1 (bases 1 to 1750) 

Jakowlew,S.B. , Dillard , P . J . , Sporn,M.B. and Roberts, A. B. 
Nucleotide sequence of chicken transforming growth factor-beta 1 
(TGF-beta 1) 

Nucleic Acids Res. 16 (17), 8730 (1988) 
88335639 
3166520 

2 (bases 1 to 1750) 
Jakowlew, S . B . 
Direct Submission 

Submitted ( 14 - JUL- 1 988 ) Jakowlew S.B., National Institute of 
health, National Cancer Institute, Laboratory of Chemopr event ion, 
Building 41, Room B902, Bethesda , Maryland 20892, USA 
The submitters believe that the chicken cDNA library was 
contaminated with porcine cDNA, and that the sequence is infact 
porcine TGF-beta-1. 27-MAR-1996. 

Location/Qualifiers 

1. .1750 

/organ ism=" Sus scrofa" 
/mol_type= n mRNA M 
/strain= "white leghorn" 
/db_xref ="taxon: 9823" 
/ c 1 on e = " pTG FB - ChX 119" 
/cell type=" chondrocyte" 
1. .1750 

/gene="TGF-beta 1" 
1. .446 

/gene=" TGF-beta 1" 
447. .1622 
/gene="TGF-beta 1" 
/codon_start=l 

/product = "transforming growth factor" 
/protem_id="CAA30933 .1" 



/db_xref = "GI : 63809" 
/db_xref = " SWISS -PROT : P07200 " 

/ 1 rans 1 a t i on = " MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIE^IRGQILSKLRLASPPSQGDVPPGPLPEAVIALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWGYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINAGFNSGRRGDLiATIHGMNRPFLLL^4ATPLERAQHLHSSR 
HRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 
TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS " 
matjpeptide 1285. .1619 

/gene="TGF-beta 1" 
3»UTR 1620. .1750 

/gene="TGF-beta 1" 

BASE COUNT 325 a 627 c 479 g 319 t 

ORIGIN 

Query Match 88.5%; Score 1205; DB 4; Length 1750; 

Best Local Similarity 94.9%; Pred . No. 8.8e-209; 

Matches 1288; Conservative 0; Mismatches 15; Indels 54; Gaps 2; 

CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

I! 1 1 1 IMIMIIIMI I II I M I II II I M M M 1 11 1 II I II M I M II II 

CCCCCATGCCGCCTTCGGGGCCTGGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 501 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

ill I Mil Nil II IM.IIII II Mlh IMIMII Mill il MIIMI III Ml III 

TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 561 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

M M MM M Ml 1 1 1 Ml I II MM M Ml II II I M M M M M M IN IM III Ml 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 621 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I M M M II I M 1 1 II II I MM I II M M II 1 1 MM MM M I M M M M IIIMM 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 681 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

1 : 1 1 1 1 M 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 741 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

1 1 1 1 1 1 1 : 1 1 1 1 II 1 1 II M ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ; M 1 1 ; 1 1 1 1 1 1 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 8 01 
AAATCTATGATAAATTC^GGGC^CCCCCCAC^GCTTATATATGCTGTTaVACACGTCGG 42 5 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 : i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I ' 1 1 1 ,, 1 1 i 1 1 1 1 1 1 

AAATCTATGATAAATTCAAGGGC^CCCCCC^C^GCTTATATATGCTGTTC^CACGTCGG 8 61 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

M 1 1 1 Ml M II I II 1 1 M I II 1 1 1 1 M I III I M I II II I M M II Mill I MM M 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 921 
GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATAC^GO^TGATTCCT 54 5 

M 1 1 II I II M II I II MM I MM III I M I II M M 1 1 1 1 1 1 M III II! IM I II 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 981 



Qy 


6 


Db 


442 


Qy 


66 


Db 


502 


Qy 


126 


Db 


562 


Qy 


186 


Db 


622 


Qy 


246 


Db 


682 


Qy 


306 


Db 


742 


Qy 


366 


Db 


802 


Qy 


426 


Db 


862 


Qy 


486 


Db 


922 



Qy 



54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 



Db 


982 


Qy 


606 


Db 


1042 


Qy 


666 


Db 


1102 


Qy 


723 


Db 


1162 


Qy 


783 


Db 


1222 


Qy 


843 


Db 


1282 


Qy 


903 


Db 


1291 


Qy 


963 


Db 


1351 


Qy 


1023 


Db 


1411 


Qy 


1083 


Db 


1471 


Qy 


1143 


Db 


1531 


Qy 


1203 


Db 


1591 


Qy 


1263 


Db 


1651 


Qy 


1323 


Db 


1711 



II MIMIMI MMMIIIMM Mill IIMIMIMIII MMMMIIMM 

GGGGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 1041 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

II I II I M 1 1 1 Ml M M I III II 1 1 M 1 1 1 1 M I II M I M III M MM MM I M M I 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 1101 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAAC- - -G 722 

IIMM'MIMIMII IMMIIIMMMMMI M MM MllMI I 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGCAG 1161 
GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 782 

M 1 1 1 1 > I II 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 '1 1 1 1 1 IM II i MM I M 1 1 1 1 1 1 

GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 1221 
TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 84 2 

i 1 i i i : 1 : i i i :, 

TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 1281 

GAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGG 902 

M lllllll 
G A GCCCTGG 12 90 



1 1 1 1 1 1 1 1 1 '1 1 1 1 h 1 1 : 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 III 1 1 1 1 1 1 : 1 1 'I I i 1 1 1 



1 1 1 , 1 1 1 1 1 M I ! 1 1 ! l I ' 1 1 1 1 ' II 1 1 M II 1 1 1 1 II 1 1 III : II i II 1 II 'I I ; 1 1 1 



TCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGG 1082 

llllll IIMIIIIIIIIIIMIIIIIIII IIIIIMIIIIIIIIMII III III Mill 

TCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGG 14 70 



llllll IIMIIIIIIIIIIMIIIIIIII III llllllllllllllll III III I 



TGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 12 02 

ii 1111:1 ri iiiiii iii'iiiii'i r 111 ii imii Mi'iiiih ii mi iiiii 

TGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 1590 
ACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCA 1262 

II M 1 1 1 M I II 1 1 1 1 1 1 M 1 1 1 1 1. 1 M II II 1 1 Ml I MM IMII I M 1 1 1 1 1 1 II 

ACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCA 1650 
CCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTG 1322 

II II I II I M 1 1 M 1 1 M I M 1 1 1 II 1 1 1 1 1 MM I Mi 1 1 1 II II M II III 1 1 1 II III 

CCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTG 1710 



RESULT 4 



SSTGFBR 

LOCUS SSTGFBR 16 05 bp mRNA linear MAN 2 7 -MAR- 19 95 

DEFINITION Porcine mRNA for transforming growth factor-beta (TFG) precursor. 

ACCESSION Y00111 

VERSION Y00111.1 GI:2129 

KEYWORDS transforming growth factor-beta. 

SOURCE Sus scrofa (pig) 

ORGANISM Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
REFERENCE 1 (bases 1 to 1605) 

AUTHORS Derynck,R. and Rhee,L. 

TITLE Sequence of the porcine transforming growth factor-beta precursor 

JOURNAL Nucleic Acids Res. 15 (7), 3187 (1987) 
MEDLINE 87174844 
PUBMED 3470708 
F EATURES Loca t i on/ Qua 1 i f i er s 

source 1 . .1605 

/ organi sm= " Sus scro fa " 
/mol_type=" mRNA" 
/db_xref ="taxon: 9823" 
/ 1 i s sue_t ype= " ovary " 
/clone_lib=" lambda gtlO" 
CDS 404. .1576 

/note="TFG-beta precursor (AA 1-390)" 

/codon_start=l 

/protein_id="CAA68291. 1" 

/db_xref="GI :2130" 

/ db_xr e f = " SWISS - PROT : P 0 7 2 0 0 " 

/ 1 rans 1 a t ion= " MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMLESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS " 

mat_peptide 1238. .1573 

/product = "mature TFG-beta" 

misc_feature 647. .655 

/note="pot. N-glycosylation site" 

misc_feature 809. .817 

/note="pot. N-glycosylat ion site" 
BASE COUNT 298 a 572 c 437 g 298 t 

ORIGIN 



Query Match 83.4%; Score 1134.8; DB 4; Length 1605; 

Best Local Similarity 95.4%; Pred. No. 4.9e-196; 

Matches 1200; Conservative 0; Mismatches 7; Indels 51; Gaps 1; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II Ml 1 1 1 II I M ! I, M 1 1 1 1 1 1 M I II 1 1 1 1, i II 1 1 1 1 M 1 1 1 1 1 1 1 l ; l I II 

Db 39 9 CCCCCATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 4 58 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

IIIIIIIIIIIIIIMMIIIIIIIIIIIIIIMMIIIIIIIMIIIIIIIIIIIIIII 

Db 45 9 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 518 



Qy 


126 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

1 ! 1 1 1 II 1 : l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 ! 1 1 ! 1 ! II: 1 1 1 1 1 1 1 Ml 1 M 1 1 II 1 

AG CTGGTGAAGCGG AAG CG CAT CG AGG C CATT CG CGG C CAGATT CTGT C CAAG CTTCGG C 


185 


Db 


519 


578 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 !l 1 II 1 1 1 1 1 1 1, 1 II 1 1 1 M lllllll 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 


245 


Db 


579 


638 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


305 


Db 


639 


Mill MMIMI II IMIMI MM MINIM Ml Mill MINIUM MINIM 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


698 


Qy 
Db 


306 
699 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

IMIIIII MIMI III MINIMI MIMMMM 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGCTGGAAAGCGGCAACC 


365 
758 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

' 1 1 1 i 1 II 1 II M 1 ! Ml M 1 1 1 1 1 M 1 II 1 II ! 1 1 1 M IL' II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


759 


818 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


485 


Db 


819 


M IN INI Ml 1 M M 11 II II 1 1 1 M M INN Ml Nil Ml MINI II Nil IN 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


878 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

IINININNINNNIMMIININMIMINNIININNMIMMIN 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 


545 


Db 


879 


938 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MMMIMIMM MIMI IMMIIII IIIMMIMI IMIIIIIIIIIIIII 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 


Db 


939 


998 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

1 1 1 1 1 1 III M 1 1 1 M II 1 1 M 1 1 1 M Ml M MIM MIMM II II MM 1 III II 1 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 


665 


Db 


999 


1058 


Qy 
Db 


666 
1059 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 1 1 1 1 1 Ml M 1 1 M 1 1 1 1 1 M M 1! Ml 1 1 1 1 II Ml 1 1 1 M ll 1 1 1 II 1 1 1 1 1 1 1 1 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 


725 
1118 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

1 1 1 M 1 II Ml M 1 1 1 1 1 II M 1 1 M Ml 1 M 1 Ml 1 M 1 1 M 1 M 1 1 1 II 1 1 1 1 1 1 1 1 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 


785 


Db 


1119 


1178 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

NIMNMININIIINIMNMNIIIIIIMMMNMNNNININN 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA- 


845 


Db 


1179 


1237 


wy 




r'PPTnnATA rra a r'Apr'TAPPr'ATArY" 1 a ^ 1 ^"TYT , ^ 1 A^ , a pts pppa TrTrTrrnr^Tpr' its 




Db 


1238 


1 f M 1 1 1 1 1 1 

GCCCTGGATA 


1247 


Qy 


906 


CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 

MM 1 1 1 II 1 1 1 M 1 1 III 1 M 1 1 1 1 1 1 II 1 II 1 1 M 1 M 1 1 1 1 1 M II 1 II 1 1 1 M 1 II 

CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 


965 


Db 


1248 


1307 


Qy 


966 


TCCGGAAGGACCTGGGCTGGAAGTGGATTC7VTGAACCCAAGGGCTACCATGCCAATTTCT 


1025 



1 1 II 1 1 1 1 , M 1 1 ! 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 , 1 1 M 1 1 1 III II 1 1 1 1 1 1 1 1! 1 1 1 1 

Db 13 08 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1367 

Qy 1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

I M 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 II 1: 1 , 1 1 1 1 1 1 1 Ml 1 1 1 1 1 J! , I 1 1 , 1 1 1 1 1 1 1: 1 

Db 1368 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 14 27 

Qy 1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

IMMIIIIIII MIMM MMMMMIMM MMMMMMMMIIMM 

Db 1428 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1487 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

M Ml 1 1 Ml 1 1 1 1 1 M 1 1 1 Ml M I II I Ml 1 1 I II 1 1 1 Ml M M 1 1 1 1 1 1 Ml M I 

Db 14 88 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1547 

Qy 1206 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCAC 1263 

MMMIIIIMI MIMM MMMMMIMM MMMMMIMIMMM 

Db 154 8 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCAC 1605 



RESULT 5 
AF461808 
LOGUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF461808 2221 bp mRNA linear MAM 03-JAN-2002 

Sus scrofa transforming growth factor beta 1 (TGFB1) mRNA, complete 
cds . 

AF461808 

AF461808 .1 GI : 18042250 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 

1 (bases 1 to 2221) 

Wimmers,K., Chomdej,S., Ponsuksili, S . and Schellander, K. 
Polymorphism in the porcine transforming growth factor beta 1 gene 
Unpublished 

2 (bases 1 to 2221) 

Wimmers,K. , Chomdej,S., Ponsuksili , S . and Schellander, K. 
Direct Submission 

Submitted (20-DEC-2001 ) Institute of Animal Breeding Science, 
University of Bonn, Endenicher Allee 15, Bonn 53115, Germany 

Locat ion/Qua 1 if iers 

1. .2221 

/organism="Sus scrofa" 

/ mo 1 _ t yp e = " mRNA " 

/db_xref ="taxon: 9823" 

/ chromosome= " 6 " 

1. .2221 

/gene="TGFBl" 

1. .1173 

/gene="TGFBl" 

/note= 11 cytokine" 

/codon_start=l 

/product=" transforming growth factor beta 1" 
/protein_id="AAL57902 . 1 " 
/db_xref = "GI : 18042251" 

/ translat ion= "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 



RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEWRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDIATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS " 
3'UTR 1174. .2221 

/ gene= " TGFB1 " 

BASE COUNT 504 a 616 c 649 g 452 t 

ORIGIN 



Query Match 81.8%; Score 1113.4; DB 4; Length 2221; 

Best Local Similarity 95.4%; Pred. No. 3.6e-192; 

Matches 1178; Conservative 0; Mismatches 6; Indels 51; Gaps 1; 



Qy 


n 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

Ml IIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


70 


Db 


i 


60 


Qy 


71 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

M II 1 1 ; 1 ! II 1 III M 1 Ml 1 1 1 II 1 1 , II m; 1 M Ml II i 1 1 1 II II !l M 1 II II 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


130 


Db 


61 


120 


Qy 


131 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

MMMMMMMMMMMMMMMMMMMMMMMMMMMM Ml 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


190 


Db 


121 


180 


Qy 


191 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

II 1 II Ml II 1 1 II M 1 M 1 1 1 1 M 1 M 1 1 1 MM il 1 II 1 II 1 ! 1 MIMIIIIMI 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


250 


Db 


181 


240 


Qy 


251 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

I 1 1 1 l 1 t l 1 1 1 1 1 l 1 l I I I I 1 l l l I I I I i l l 1 l l l I l I I I i I I I I I I I I I I I I ( I I I I I I 

II M II M 1 1 II M 1 M 1 M 1 II 1 1 M M 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 M II 1 II 1 1 1 1 II 1 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


310 


Db 


241 


300 


Qy 


311 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMII 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


370 


Db 


301 


360 


Qy 


371 


TATGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

II Ml M 1 M 1 III M 1 Ml II 1 1 1 1 II II II! 1 M MM II 1 MMM II 1 M MM 

TATGATAAATTCAAGGGC^CCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


430 


Db 


361 


420 


Qy 


431 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

Ml ' ' MM 'MM ; MMM 1 MMMM MM MMM 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


490 


Db 


421 


480 


Qy 


491 


AAGTTAAAAGTGGAGC^GC^CGTGGAGCTATACC^GAAATACAGCAATGATTCCTGGCGC 

M II II 1 M MMM 1 1 M MM III II 1! Ill 1 M 1 II II II 1 Ml 1 II II MMMI 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


550 


Db 


481 


540 


Qy 


551 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIMIIIIIIIIIIIMIII 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


610 


Db 


541 


600 


Qy 


611 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


670 



IMIIIIIIIIIIIMIMIIIIIIIIII Mill IMIIIIIIIIIIIIIIIIIIIIIII 



Db 



601 



ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 



660 



Qy 


671 


Db 


661 


Qy 


731 


Db 


721 


Qy 


791 


Db 


781 


Qy 


851 


Db 


835 


Qy 


911 


Db 


850 


Qy 


971 


Db 


910 


Qy 


1031 


Db 


970 


Qy 


1091 


Db 


1030 


Qy 


1151 


Db 


1090 


Qy 


1211 


Db 


1150 



IMIIM 



TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 7 9 0 

MM I II II 1 1 1 1 II II M MM I II II 1 1 M M 1 1 Ml I III 1 1 1 II II 1 1 1 M I II I 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 850 

M 1 1 1 II 1 1, 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II, 1 1 1 1 M II 1 1 1 ll I II li I h 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA 834 



851 GATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACCAAC 910 

M 1 1 II ! M II II 11 

83 5 GCCCTGGATACCAAC 84 9 



TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGG 

I III 1 III I ' III , II II I MM I III I II II; III ! MM I II h I II I, I Mi ! Ml I 

TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGG 



970 



909 



AAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTG 103 0 

M M 1 1 II M I II I M I M MM M M M 1 1 M II 1 1 1 II MM 1 1 MMI Ml I M 1 1 II I 

AAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTG 969 
GGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC 1090 

II M 1 1 1 II [ II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II i 1 1 II II 1 1 1 1 II 1 1 11 M 1 1 II 1 1 1 1 1 M 

GGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC 102 9 



i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 r 1 1 1 1 f 1 1 1 1 1 1 1 r 1 1 1 1 1 1 ! 1 1 1 1 1 r 1 1 1 1 1 



CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 1210 

1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 r 1 1 1 1 1 1 1 r r 1 1 1 r 1 1 1 1 1 f 1 1 1 f i r i 

CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTGCAACATGATC 114 9 
GTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCC 124 5 

M M M II M 1 1 1 M I MM I II M II MMI II 

GTGCGTTCCTGCAAGTGCAGCTGAGGCCCCAAGCG 1184 



RESULT 6 

DOGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



DOGTGFB1A 13 69 bp mRNA linear MAM 3 0 -OCT- 19 94 

Canine transforming growth factor-beta 1 (TGFB1) mRNA, complete 
cds . 
L34956 

L34956. 1 GI : 516071 

homologue; transforming growth factor-beta 1. 
Can is familiaris (dog) 
Canis familiaris 

Eukaryota; Metazoa; Chorda ta; Crania ta ; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 
1 (bases 1 to 1369) 

Manning, A.M. , Auchampach, J . A. , Drong,R.F. and SI ightom, J . L . 



TITLE 

JOURNAL 
COMMENT 

FEATURES 

source 



gene 



5 1 UTR 



CDS 



mat peptide 



3 ' UTR 



BASE COUNT 
ORIGIN 



Cloning of a canine cDNA homologous to human transforming growth 
factor-beta 1 (TGFbetal) 
Unpublished (1994) 

Original source text: Canis familiaris adult jugular vein 
endothelial cDNA to mRNA . 

Locat ion/Qua 1 if iers 

1. .1369 

/organism^" Canis famil iaris" 

/mol_type=" mRNA 11 

/db_xref ="taxon: 9615" 

/cell_type=" LPS -activated" 

/tissue_type=" jugular vein endothelial" 

/dev_s tage= " adul t " 

1. .1369 

/gene="TGFBl" 

1. .57 

/gene="TGFBl" 
58. .1230 
/gene="TGFBl" 

/f unct ion= "ant i - inflammatory agent " 

/note=" precursor" 

/codon_start=l 

/product = "transforrrung growth factor-beta 1" 
/protein_id="AAA51458 .1" 
/db_xref="GI : 516072" 

/ 1 rans 1 a t i on= " MP PSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLSSPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVENTNKI YEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKAEQHVELYQKYSNDSWRYLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFR 
LSAHCSCDSKDNTLQVDINGFSSSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQ 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS " 
1039. .1227 
/gene="TGFBl" 

/product=" transforming growth factor-beta 1" 
1231. .1369 
/gene=" TGFB1" 
264 a 473 c 415 g 216 t 1 others 



Query Match 73.0%; Score 993.8; DB 4; Length 1369; 

Best Local Similarity 86.7%; Pred. No. 1.9e-170; 

Matches 1150; Conservative 0; Mismatches 123; Indels 53; Gaps 



3; 



Qy 

Db 



6 C CG AG ATGG CG CCTT CGGGG CTG CGG CTCTTG C CG CTG CTG CTG C CG CTG CTGTGG CTG C 6 5 

II III I I I I II lllllllllll lllllllllllllllllllllll I I I I I 
5 3 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTGCCGCTGCTGCGGCTAC 112 



Qy 

Db 

Qy 

Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

IIMIIIIMMIMIMIIIIMIIIIIIMMIIIIIIMIIIIIIIMIIIIIIIM 

113 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 172 
12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

I '1 1 1 1 i , 1 1 1 1 1 1 1 il 1 1 In II 1 1 1 , 1 1 lllllllllll I ! I 1 1 ! M 1 1 1 1 1 1 1 

173 AACTGGTG AAGCGGAAGCG CAT CG AGG C CAT C CG CGGC CAGAT C CTGT C CAAG CTG CGGC 232 



Qy 



186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 



i : 1 1 1 1 1 , 1 1 1 1 ; i 1 1 1 1 ■ 1 1 1 1 iiiiniiii iiiiiiii iiiii ii 1 1 1 1 

233 TCTCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGTGCCGCTGCCCGAGGCCGTGCTGG 



Db 


233 


TCTCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGTGCCGCTGCCCGAGGCCGTGCTGG 


292 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


305 


Db 


293 


1 II IIIIIIII 1 1 1 1 1 II 1 1 1 1 1 1 1 II lllll II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 
CCCTCTACAACAGCACCCGCGACCGGGTGGCGGGGGAGAGCGCCGAGCCGGAGCCCGAGC 


352 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 1 1 1 1 1 M 1 1 II 1 II 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mil 


365 


Db 


353 


1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 I 1 I 1 1 1 1 

CCGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAACACCAACA 


412 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGGTTATATATGCTGTTCAACACGTCGG 
llllllllll III lllll 1 II 1 II 1 1 1 llllllllll IIIIIIII II 1 


425 


Db 


413 


iiiiiiiiii iii iiiii i ii i i i i i i iiiiiiiiii iiiiiiii ii i 
AAATCTATGAGAAAGTCAAGAAAAGTCCG CACAG CATATATATG CTCTT CAACACAT CAG 


472 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 
IIIIIII lllll lllll II II II IIIIIIII IIIIIIIII IIIIIIIIIIII 


485 


Db 


473 


IIIIIII lllll lllll II II II IIIIIIII IIIIIIIII IIIIIIIIIIII 

AGCTCCGAGAAGCAGTGCCTGAGCCCGTCTTGCTCTCCCGGGCAGAGTTGCGCCTGCTGA 


532 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 


545 


Db 


533 


1 II 1 1 II 1 1 1 1 II II 1 1 1 1 1 1 II 1 IIIIIIII lllllllllll 1 II 1 II II 1 1 1 1 1 
i i i i i i i i i i i i i i i iiiiiiiii iiiiiiii iiiiiiiiiii iiiiiiiiiiiii 

GGCTCAAGTTAAAAGCGGAGCAGCATGTGGAGCTGTACCAGAAATATAGCAATGATTCCT 


592 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 
Illlllllllllllllllllllllllll | | | | | HI | || || lllllllllllll 
GGCGCTACCTCAGCAACCGGCTGCTGGCGCCCAGCGACACGCCAGAATGGCTGTCCTTTG 


605 


Db 


593 


652 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

IIIIIII lllll III IIIIIIIIIIII II III M 1 lllll IIIIIII 

ATGTCACTGGAGTCGTGAGGCAGTGGCTGAGCCATGGAGGGGAAGTCGAGGGCTTTCGCC 


665 


Db 


653 


712 


Qy 


666 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 


725 


Db 


713 


1 1 1 1 II 1 1 1 1 1 1 II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II M 1 II II 1 II 1 1 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTGCAAGTAGACATTAACGGGT 


772 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 
II 1 III II 1 1 M 1 1 II 1 1 i 1 1 II II 1 1 II II II II 1 1 1 II II 1 1 llllllllll 


785 


Db 


773 


iii iii iiiiiii iiiiiiiiiiiiiiiiimi iiiiiiiiiii iiiiiiiiii 
TCAGTTCCAGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGACCCTTCCTGC 


832 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

II 1 II II II II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II 1 1 1 1 1 1 II 1 1 1 1 Mill 
i i i i i i i i i i i 1 1 i i i i i i i i i i i i i 1 1 i i 1 1 1 i i i 1 1 1 i i i i i i i i 1 1 i i iiiii 

TCCTCATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCAGCGCCG - - 


845 


Db 


833 


890 


Qy 


846 


CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 

1 II 1 II 1 II 1 


905 


Db 


891 


Miiiilll I 


901 


Qy 


906 


CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTAPATT^ACT 


965 


Db 


902 


M 1 1 11 1 1 1 1 M M 1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 IMMIMIIIIMIMM 

CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACT 


961 


Qy 


966 


TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 


1025 


Db 


962 


MM Mill MMMMMMIMM Mill IMIIMI Mill M M MM 

T C CG CAAGG ATCTGGG CTGG AAGTGG AT C CATG AGCC CAAGGGTTACCACGCT AACTT CT 


1021 


Qy 


1026 


GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 


1085 



IMIMMMMI IIIINM MINIM Mill 1 1 II I II II 1 1 II M I II II I 



Db 



1022 GCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1081 



Qy 1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

IIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIMIIMIIII 

Db 1082 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1141 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIMIIIMIIMIIII 1 1 1 1 

Db 1142 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACA 12 01 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCG-CCCACAGCCCCGCCCACC 1264 

IIMIIMII IIIIIIIMIIIIIIIIIIIIIIMIMII II III III II I 

Db 12 02 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGTCCGGCAGGCCCCGCCCAC 1261 

Qy 1265 GGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCC 1324 

lllllll MINIM M II II II I M I II IIIMIIIIIIIIIIIII M Ml 

Db 1262 CGGCAGGNCCGGCCCCGCCCCCGCCCGCTGCGCC-GGGCTGTATTTAAGGACACCGCGCC 1320 

Qy 1325 CCAAGC 1330 

III II 

Db 1321 CCAGGC 1326 



RESULT 7 

E00973 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



E00973 2527 bp RNA 

cDNA encoding human TGF-beta. 
E00973 

E00973. 1 GI :2169234 
JP 1986219395-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 2527) 
Riku , M . A . D . D . and Debitsudo , B . G . 
NUCLEIC ACID CODED WITH TGF-BETA AND ITS 
Patent: JP 1986219395-A 1 29-SEP-1986; 
GENENTECH INC 
OS 
PN 
PD 
PF 
PR 
PI 



linear PAT 29-SEP-1997 



Craniata ; Vertebrata; Euteleostomi ; 
Catarrhmi; Hominidae; Homo. 



USE 



human 

JP 1986219395-A/l 
29-SEP-1986 

20-MAR-1986 JP 1986064661 
22-MAR-1985 US 85 715142 

RIKU MAIKERU ANDORE DERINKU, DEBITSUDO BANNOOMAN GETSUDERU 
C12P21/00,C12Nl/00,C12N5/00,C12N15/00//C12Ql/68, (C12P21/00, PC 
C12R1-.91) , 



PC 



PC 

cc 
cc 
cc 
cc 
cc 

FH 
FT 
FT 



(C12N1/00,C12R1 : 19) , ( C12N5/0 0 , C12R1 : 91 ) .. (C12N1 5 /CM , C12R1 r 91 ) ; 
strandedness : Double; 
topology: Linear; 
hypothet i ca 1 : No ; 
anti-sense: No; 

♦source: tissue_type=placenta and glyoblastoma; FH Key 
Locat ion/Qual if iers 



CDS 



842. .2014 

/product= 'pre TGF-beta' 



FT 


mat_peptide 


1676. .2011 


FT 




/product= 'TGF-beta » 


FT 


5 'UTR 


1. .841 


FT 


3 'UTR 


2015. .2537 


FT 


stem_loop 


37. .113 


FT 


stem_loop 


2015. .2100 


FT 


polyA_site 


2514. .2519. 


FEATURES 


Location/Qualifiers 


source 


1. .2527 






/organism=" 


Homo sapiens" 




/mol_type=" 


genomic RNA" 




/db xref = M taxon: 9606" 


BASE COUNT 


472 a 888 c 


735 g 432 t 


ORIGIN 






Query Match 


72.2%; 


Score 982.8; DB 6 


Best Local Similarity 85.1%; 


Pred* No. 1.8e-168 


Matches 1144; 


Conservative 


0; Mismatches 147, 



2; 



Qy 


6 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II III MM II llllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


65 


Db 


837 


896 


Qy 


66 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 ! 


125 


Db 


897 


1 1 M 1 1 1 1 II M II 1 M 1 II II 1 1 II 1 1 II 1 II 1 1 II II 1 II 1 1 1 1 II 1 II II 1 1 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 


956 


Qy 


126 


AGCTGGTGAAGCGGAAGCGCATCGAGGCC7VTTCGCGGCCAGATTCTGTCCAAGCTTCGGC 


185 


Db 


957 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 MMIIIMM lllllllllll MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


1016 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 


245 


Db 


1017 


1 M M II 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 IIIIIIIIIIIIIIIHIII Mill 1 1 II 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


1076 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


305 


Db 


1077 


1 M MMMM MIMMIIMIII IMIMM MM 1 1 1 1 II 1 1 1 1 II 1 M 1 

CCCTCTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


1136 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 


365 


Db 


1137 


1 Mill MMIMIMIIMIIMIIIMIIIMMIIIIIMMIIM 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


1196 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCAC^GCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


1197 


MMMMII M MMM 1 1 MMM MMMM 1 MMMM M 1 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGC^ 


1256 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTG CTCTPTrGGGCAGAGCTG CG CCTG CTGA 

MIMM MMMM M Mill II MMMM IIIMMIIIIMI MMIM 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


a n r- 

-x O -J 


Db 


1257 


1316 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGC^G(^CGTGGAGCTATACC^GAAATACAGa^TGATTCCT 


545 


Db 


1317 


MIMMMMMMMMMMIMMMMM IMMMMMMMM MMM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATAC^^ 


1376 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 



Mil llllllll MMMM! II I Ml Mill Mill II I II II I I II MM 



Db 1377 GGCGATACCTCAGC^ACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 

Qy 606 ATGTCACGGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

lllllllilllllllillillllll ill ill Ml II II Mill lllllll 

Db 14 37 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

Qy 666 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I I I llllllll Illllllllllll lllllllllll I I Mill M lllllll 
Db 14 97 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

Qy 726 TCAATTCTGGCCGGCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I MMMM I M 1 1 f 1 1 1 1 1 1 1 1 1 1 1 II I MMMMMMM lllllll 

Db 1557 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I 1 1 M M 1 1 M i I ( 1 1 1 M 1 1 1 1 i I II M II II Mill M 1 1 1 1 M II M 1 1 1 1 1 1 

Db 1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

Qy 84 6 CCCTGGATACCAACAGGTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

MMMM I 

Db 1676 GCCCTGGACA 1685 

Qy 906 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

lllllll MMMIIIMMIMIIMIIIMIIIIIIIMMIMM IMMMMI 

Db 1686 CC^CTATTGCTT(^GCTC(^CGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 174 5 

Qy 966 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCA.TGCCAATTTCT 1025 

MM MMMM MMMMMMM M II llllllllllllllllllll MM 

Db 1746 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 05 

Qy 1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1085 

I II I MMMM MMMM MMMM Mill 11 II II 1 1 1 II I II II II 1 1 I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1865 

Qy 1086 TGTACAACCAGC^CAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

Illllllllllll lllllllllll I ( I ( 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 { 1 1 1 1 f M 1 1 1 1 1 1 1 1 

Db 1866 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 

Qy 114 6 AGCCACTGCCC^TCGTGTACTACGTGGGCCGO^GCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

MM I II i ! 1 1 1 ! i ! , I ! ! i 1 1 1 1 ! 1 1 i I 1 1 ! 1 1 1 1 ! 1 1 ! 1 1 1 , 1 llllllllllll 

Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGGAGCTGTCCAACA 1985 

Qy 12 06 TGATCGTGCGTTCCTG(^GTG(^GCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 12 65 

IMMMMI IIIIMIIIIMIIIIIMI MMIMMM I MMMMI II 

Db 1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC- - CCGCCCCGCCCGGCC 2043 

Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCC 132 5 

II IMIIMM ! !! !! M M M ! I ! M 1 1 M II I M II II 

Db 2044 CCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCC 2103 

Qy 1326 CAAGCCCACTTGGGATCGATTAAA 13 4 9 

MMMM Ml I I II 

Db 2104 CAAGCCCACCTGGGGCCCCATTAA 2127 
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A06669 2537 bp mRNA 

Synthetic mRNA for preTGF-Betal . 
A06669 

A06669. 1 GI :412940 

synthetic construct 
synthetic construct 
artificial sequences . 
1 (bases 1 to 2537) 



NUCLEIC ACID ENCODING TGF- beta 3 AND ITS USE 



linear PAT 29-JUL-1993 



BASE COUNT 
ORIGIN 



Patent: WO 8912101-A 4 14 -DEC- 198 9 ; 
Location/Qualifiers 
1. .2537 

/organism= " synthetic construct " 

/mol_type=" mRNA " 

/db_xref = M taxon : 3263 0 " 

842. .2014 

/codon_start=l 

/ trans l_table= 11 

/product ="preTGF-betal" 

/protein_id="CAA00588 .1" 

/db_xref="GI : 412 941" 

/ trans la tion="MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNE I YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
473 a 893 c 739 g 432 t 



Query Match 72.2%; 
Best Local Similarity 84.9%; 
Matches 1148; Conservative 



Score 982; DB 6; Length 2537; 
Pred. No. 2.5e-168; 
0; Mismatches 145; Indels 59; 



Gaps 



2; 



Qy 

Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II Ml I I I I II lllllllllll lllllllllllll II I I I I I I I I: I M I 
83 7 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



QY 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



66 



897 



126 



957 



TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

I 1 1 1 i , 1 1 1 1 1 1 1 1 1 1 1 MINIMI Mill I J 1 1 1 1 1 1 j 1 1 1 1 1 IMIIIIIII 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 



125 



956 



185 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II lllllllllll MINIMI MM 

AG CTGGTG AAG CGG AAG CG CAT CGAGG C CAT CCG CGG C CAG AT C CTGTC CAAG CTG CGGC 1016 



18 6 TTG CCAG C CCCC CGAG CCAGGGGGACGTG CCG CC CGGC CCG CTG CCTGAGG CAG TACTGG 24 5 

I 1 1 M 1 1 1 1 ! 1 1 1 M 1 1 1 1 M I M IMIMIMIIMMIMI Mill II II I 

1017 TCG C CAG C C CCC CGAG C CAGGGGGAGGTGC CG C CCGG CCCG CTG C C CGAGG CCG TG CT CG 1076 



246 



305 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

I II 1 1 1 1 1 1 1 1 MMMIMIMM Mill M MM MMI Ml! Mill 

1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 113 6 



Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill Mill M Mill II MM Ml M III II MMM MMM II 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


1137 


1196 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


1197 


hi Ml 1 M Mill 1 1 MMM MMMM 1 llllllll II 1 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 


1256 


Qy 


426 


AG CT C CGGG AAG CG G TG CCGG AA CCTGTA TTG CT CT CT CGGG CAG AG CTG CG C CTG CTG A 


485 


Db 


1257 


MMMI MMMM M IMM M MMMM IMMMMMMI MMIM 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


1316 


Qy 


486 


GG CTCAAGTT AAAAG TGG AG CAG CACGTGG AG CT AT ACCAG AAATACAG CAATG ATTCCT 

1 1 f 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 1 f 1 1 1 J 1 f i 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 J 1 1 1 1 1 MMM 

GGCT(^GTTAAAAGTGGAG(^GCACGTGGAGCTGTACCAGAAATACAGCAA(^TTCCT 


545 


Db 


1317 


1376 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 


Db 


1377 


MM MMMMIMMMIMIMM Mill IMM II MMM 1 M MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


1436 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 


665 


Db 


1437 


MMMMMMMMMMMMM Ml Ml Ml II M IMM MMMI 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


1496 


Qy 


666 


TCAGTGCC(^CTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 


725 


Db 


1497 


1 M MMMM lllllllllllll lllllllllll II IMM II MMMI 

TTAGCGCCCACTGCTCCTGTGAC^GC^GGGATAA(^C^CTG(^GTGGACATC^CGGGT 


1556 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III 1 1 MMMM IIIIIIIIIIMIIIIIIII IIIMIIIIIIIII lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1557 


1616 


Qy 


786 


TCCTC^TGGCOICCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 II 1 M M II 1 M II 1 1 M M |[| | i 1 1 1| 1 1! |||| || 1 II 1 1 M II 1 1 1 1 1 II 

TTCTC^TGGCO^CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1617 


1675 


Qy 


846 


CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 


905 


Db 


1676 


MMMM 1 

GCCCTGGACA 


1685 


Qy 


906 


CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 

MMMI 1 M II M III 1 M MM 1 1 II II 1 II 1 III MM M Ml 1 IMIMMII 

CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 


965 


Db 


1686 


1745 


Qy 


966 


TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 

MM MMMM MMIIIMIIIII M II 1 II II 1 II II 1 1 1 II 1 1 1 1 1 MM 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 


1025 


Db 


1746 


1805 


uy 


1 Pi o c 


LjLL. 1 LjLjULiLL-L 1 Li 1 LLL 1 ALA 1L1 LjOAOLL I ALiACJAL I UAb I ALAOLAALjCj I LL I LjGLT. L 


1 P o c 

108 5 


Db 


1806 


MM MMMM MMMM MMMM Mill 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 II 1 1 

GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 


1865 


Qy 


1086 


TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 

1 M II M 1 1 1 II 1 IMIMMIM M Ml 1 M 1 M 1 IM 1 Ml 1 1 M II M MM III 

TGTACAAC CAG CATAACC CGGG CGCCTCGGCGGCGCCGTG CTG CGTGCCGCAGGCGCTGG 


1145 


Db 


1866 


1925 


Qy 


1146 


AGCC^CTGCCCATCGTGTACTACGTGGGCCGC^GCCCAAGGTGGAGCAGCTGTCCAACA 


1205 



1 1 1 1 I MM I llllllll IM Illllll II 1 1 III I !MII ! II I II III II Mill I 

Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 198 5 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

MMMMM MMMMMIMMMMI MMMMMII M I M I 

Db 1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2 04 5 
Qy 12 66 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I MM I MM MM Ml M M MMMMMMMMMI 

Db 2 04 6 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 

Qy 1318 T CG TG CC C CAAG C CCA CTTGGG AT CGATTAAA 134 9 

MMMMMMMM MM I I 1 1 

Db 2106 CCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 213 7 



RESULT 9 

OATGFB1 

LOCUS 

DEFINITION 
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KEYWORDS 
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REFERENCE 
AUTHORS 
TITLE 
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MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 
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CDS 



OATGFB1 1173 bp mRNA linear MAM 18-APR-1995 

O.aries mRNA for transforming growth factor-beta I. 

X76916 

X76916.1 GI :496648 

TGF-beta 1; transforming growth factor-beta 1. 
Ovis aries (sheep) 
Ovis aries 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Caprinae; Ovis. 
1 

Woodall , C . J . , McLaren,L.j. and Watt, N.J. 

Sequence and chromosomal localisation of the gene encoding ovine 

latent transforming growth factor-beta 1 

Gene 150 (2) , 371-373 (1994) 

95121932 

7821809 

2 (bases 1 to 1173) 

Woodall, C. 

Direct Submission 

Submitted (24 -DEC- 1993 ) C. Woodall, Univ. of Edinburgh, Dept. of 
Veterinary Pathology, Sc. of Vet. Studies, Univ. of Edinburgh, 
Edinburgh EH 9 IQH, UK 

Locat ion/Qua 1 if iers 

1. .1173 

/organism="Ovis aries" 
/ mo 1 _t yp e = " mRNA " 
/db_xref ="taxon: 9940" 
1. .1173 
/codon_start=l 

/product =" transforming growth factor-beta 1" 
/protein_id="CAA54242 . 1" 
/db_xref="GI : 4 9664 9" 
/db_xref ="SWISS-PROT: P50414" 

/ trans la t ion= "MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRK 
GIEAIRGQILSKLRLASPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEAD 
YYAKEVTRVLMVEYGNKI YDKMKSSSHSI YMFFNTSELREAVPEPVLLSRADVRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFR 
LS AHCS CDS KDNTLQ VD I NG F S SGRRGDLAT I HGMNR PFLLLMAT PLERAQHLHS S RH 



RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
matjpeptide 835. .1170 

/product =" transforming growth factor-beta 1" 

BASE COUNT 245 a 378 C 336 g 214 t 

ORIGIN 

Query Match 71.7%; Score 976; DB 4; Length 1173; 

Best Local Similarity 88.9%; Pred . No. 3.3e-167 ; 

Matches 1088; Conservative 0; Mismatches 85; Indels 51; Gaps 1; 

ATGG CG CCTTCGGGG CTG CGG CT CTTGC CG CTG CTG CTG CCG CTG CTGTGG CTG CTAGTG 70 

III I I Ml, Mill II II 1 1 1 1 1 1 1 M I II I il . 1 1 1 1 1! 1 1 1 . Ill II 

ATGCCGCCTTCGGGGCTGCGGCTGCTGCCGCTGCTGCTGCCGCTGCTGTGGCTACTAATG 6 0 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

Ml IMMIM NMM "MMi MM MIMMII 

CTGACGCCTGGCCGGCCGGTCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

llllllllllll MM Mill Mill 1 1 1 1 1 1 MUM MINIM Ml 

GTGAAGCGGAAGGGCATCGAGGCCATCCGCGGTCAGATTTTGTCCAAACTTCGGCTCGCC 18 0 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 25 0 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIMIMI III Mill I Mill III 

AGTCCCCCGAGCCAGGGGGACGTGCCACCCGGCCCGCTGCCCGAGGCCATACTGGCCCTT 24 0 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

I MM II I M 1 1 Ml II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I r 1 1 1 1 1 I i [ 1 1 1 1 1 1 

TACAACAGTACCCGCGACCGGGTGGCCGGGGAAAGTGCCGAAACGGAGCCTGAGCCAGAG 3 00 
GCGGACTACTACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 7 0 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I Mlllll Mill 

GCGGACTACTACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAATACGGC^C^AAATC 360 
TATGATAAATTC^GGG(^CCCCCC^(^GCTTATATATGCTGTTC^ACACGTCGGAGCTC 4 3 0 

Mill III I Ml III MIMI MINIM I IIIIIIIINI MINI 

TATGAOW^TGAAGTCTAGCTCGC^OVGC^TATATATGTTCTTC^a^CGTCCGAGCTC 42 0 
CGGGAAG CGGTG C CGGAA CCTGTATTG CT CT CT CGGG CAGAG CTG CG CCTG CTG AGG CTC 4 9 0 

iiiiiiii inn 1 1 1 1 1 1 1 1 i r 1 1 1 1 1 f r 1 1 1 1 f 1 1 1 iiiiiiiiiiiiiiiii 

CGGGAAGCAGTGCCTGAACCTGTGTTGCTCTCTCGGGCAGACGTGCGCCTGCTGAGGCTC 4 8 0 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 55 0 

I I I I N I II I II II I I II I II II I I I I N II I I II II II I II II I I I I I i I I I I I I 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATATAGCAACAATTCCTGGCGC 54 0 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 610 

I M M 1 1 M I II MM I M IIIIINI I N I II II II I II 1 1 II 1 1 II 1 1 1 II III 

TACCTCAGCAACCGGCTGCTCGCCCCCAGCGACTCACCGGAGTGGCTGTCCTTTGACGTC 6 0 0 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 0 

II M I M 1 1 II MM I III I II III 1 1 1 1 1 M I INN II 1 1 i 1 1 1 1 1 ! M I 

ACTGGAGTTGTGCGGCAGTGGCTGACCCACAGAGAGGAAATAGAAGGCTTTCGCCTCAGT 66 0 
GCCCACTGTTCCTGTGAC^GOW^GATAAC^CACTCC^CGTGGAAATTAACGGGTTCAAT 730 

II M 1 1 1 II 1 1 1 1 MM 1 1 II IIIIIIII II II INN II IMINIIM I 



Qy 


n 


Db 


l 


Qy 


71 


Db 


61 


Qy 


131 


Db 


121 


Qy 


191 


Db 


181 


Qy 


251 


Db 


241 


Qy 


311 


Db 


301 


Qy 


371 


Db 


361 


Qy 


431 


Db 


421 


Qy 


491 


Db 


481 


Qy 


551 


Db 


541 


Qy 


611 


Db 


601 


Qy 


671 



Db 



661 GCCCACTGTTCCTGTGACAGTAAGGATAACACGCTTCAAGTGGACATCAACGGGTTCAGT 72 0 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



731 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 790 

II I M 1 I I I 1 I I M I I I I 1 II II I I I I I I I I I I I II I I I I I I I I II I II I II I I I II I 
721 TCCGGCCGCCGGGGTGACCTCGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 



791 



850 



ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

1 1 [ I I [ [ I 1 1 I 1 1 1 ! I ! I ! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 ! I M II I , I lllllllll 

781 ATGGCCACCCCTCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGCCACCGCCGA 834 

851 GATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACCAAC 910 

IIIIMII llllll 

835 GCCCTGGACACCAAC 84 9 

911 TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGG 970 

MMIIIIIMMIMI II MIUIMMI II I! 1 1 1 1 1 1 II 1 1 II I M 1 1 1 II I 

8 50 TACTGCTTCAGCTCCACAGAAAAGAACTGCTGTGTTCGTCAGCTCTACATTGACTTCCGG 909 
971 AAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTG 103 0 

iiiiMiiiiiiiiiiiiiiiiiiii 1 1 1 1 1 1 ri ii in i i i ! ii ii 1 1 : i I I I I I 

910 AAGGACCTGGGGTGGAAGTGGATTCACGAACCCAAGGGCTACCACGCCAATTTCTGCCTG 96 9 
1031 GGGCCCTGTCCCTACATCTGGAGCCTAGACACTC^GTACAGCAAGGTCCTGGCTCTGTAC 1090 

llllllllllllllllllllllll Mill 1 1 1 il I II 1 1 ! 1 1 1 1 II I . MINI 

970 GGGCCCTGTCCCTACATCTGGAGCCTGGACACACAGTACAGCAAGGTCCTGGCCCTGTAC 102 9 
1091 AACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 115 0 

M It II M II 1 M 1 1 M 1 1 1 1 1 1 1 M ! I M 1 1 1 : 1 ! I M 1 1 1 1 IMIIMMII II 

103 0 AACCAGCACAACCCGGGCGCATCGGCGGCGCCGTGCTGCGTGCCTCAGGCGCTGGAACCC 108 9 
1151 CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 1210 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 II I L Ml' 1 1 II 1 1 1 1 II I II M ! 1 1 1 II 1 1 1 1 : 1 1 1 1 

1090 CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGTTGTCCAACATGATC 114 9 
1211 GTGCGTTCCTGCAAGTGCAGCTGA 1234 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 

1150 GTGCGCTCCTGCAAGTGCAGCTGA 1173 



RESULT 10 

BC000125 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



BC000125 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:3119 IMAGE : 33 5 1664 , mRNA, complete cds . 
BC000125 

BC00 0125.1 GI : 12 65274 8 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Honunidae; Homo. 
1 (bases 1 to 1780) 
Strausberg, R . 
Direct Submission 

Submitted ( 03 -NOV-2000 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 



USA 

REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov 
COMMENT Contact: MGC help desk 

Ema 1 1 : cgapbs - r@ma i 1 . nih . gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thome, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra. 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row: e Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

Loca t ion/ Qual i f iers 
1. .1780 

/organism="Homo sapiens" 

/mol_type= " mRNA" 

/db_xref ="taxon: 9606" 

/clone="MGC:3119 IMAGE : 335 1664 •* 

/tissue_type= n Eye, retinoblastoma" 

/ c 1 one_l ib= " N I H_MGC_ 1 6 " 

/lab_host="DH10B-R" 

/note= "Vector : pOTB7 " 

447. .1619 

/codon_start=l 

/product=" Similar to transforming growth factor, beta 1" 
/protein_id="AAH00125 . 1" 
/db_xref="GI : 12652749" 

/trans lation= "MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS " 
341 a 648 c 493 g 298 t 



Query Match 71.7%; 
Best Local Similarity 84.6%; 
Matches 1144; Conservat ive 



Score 975.6; DB 9; 
Pred. No. 3.7e-167; 
0; Mismatches 149; 



Length 1780; 
Indels 59; Gaps 



2; 



Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III I I II II lllllllllll III lllllllll I, I II I II I I I II I I 
Db 44 2 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 501 



Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I IIIIIIIMIIIIIIIIIIIIIIMI Mill IIIIIIIIIIIIII IIIIIIIIII 

Db 502 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 561 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

MMIMMMMMIMIMIIMIIMM M 1 1 f 1 1 1 1 1 1 lllllllllll MM 

Db 562 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 621 

Qy 18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I MMMMMMMMIMMM MMIMMMMMMMI Mill II M I 

Db 622 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 681 

Qy 24 6 CTCTTTACAACAGTACGCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I M llllllll MIIIMIIIIIII llllllll MM IMIIIIIIIIIIIII 

Db 682 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 741 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill IIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIMIMMIM I MM 

Db 74 2 CTGAGGCCGACTACTACG CCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAA CG 801 

Qy 3 66 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

I M M II II I II MUM I I MUM llllllll I llllllll M I 

Db 802 AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 861 

Qy 42 6 AG CT C CGGG AAG CGGTG CCGGAA C CTGTATTG CT CT CT CG GG CAGAG CTG CG CCTG CTGA 48 5 

lllllll llllllll II Mill II MUM IMMIIIIIIIII lllllll 

Db 862 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 921 

Qy 4 86 GGCTCAAGTTAAAAGTGGAGO^GCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 It 1 1 1 1 1 1 1 1 i I i M 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 MUM 

Db 922 GGCTC^GTTAAAAGTGGAGCAGC^CGTGGAGCTGTACC^GAAATACAGCAACAATTCCT 981 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 05 

MM MMMMMMMMIMMM Mill Mill II IMIM I II MM 

Db 982 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1041 

Qy 606 ATGTCAC CGGAGTTGTGCGG CAGTGG CTGACCCG CAGAGAGGCTATAGAGGGTTTTCGCC 665 

IMMIIIIMIIIIMIIIIIIIII III III III 1 1 II Mill lllllll 

Db 1042 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1101 

Qy 666 TCAGTGCCC^CTGTTCCTGTGACAGCAA^ 725 

I II MIMIII MIIIMIIMM IMMIIMM II Mill II MIMM 

Db 1102 TTAGCGCCCACTGCTCCTGTGA(^GC^GGGATAA(^(^CTG(^GTGGACATa^ACGGGT 1161 

Qy 72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I llllllll I I I I I I I 1 I f I I I I I I I I I I IIIMIIMIIMI lllllll 

Db 1162 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1221 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I II M II II II II II II I II 1 1 1 II I M II II I Mill IMMIIIMIIMMII 

Db 1222 TTCTC^TGGCOVCCCCGCTGGAGAGGGCCC^GC^TCTGa^AGCTCCCGGCACCGCCGA- 128 0 

Qy 846 CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 9 05 

MINIM I 

Db 1281 GCCCTGGACA 12 90 



Qy 


906 


CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 


965 


Db 


1291 


Mill M 1: 1 1 M II 1 1, 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 ' M 1 IN 1 1 1 'MINIM 

CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 


1350 


Qy 


966 


TCCGGAAGGACCTGGGCTGGAAGTGGATTC^TGAACCCAAGGGCTACCATGCCAATTTCT 


1025 


Db 


1351 


MM MINIM MMMMMMM 1 1 M MMMMMMMMMM MM 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 


1410 


Qy 


1026 


GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 


1085 


Db 


1411 


MM IMMMI MMMM MMMM MMI MMMMMMMMMM 1 

GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 


1470 


Qy 


1086 


TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 

MMMMMMI MMMMMI M II M M M M M M M M M M M M M M M 

TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1145 


Db 


1471 


1530 


Qy 


1146 


AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACZA 


1205 


Db 


1531 


MM 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 . 1 1 1 ! 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1! 1 1 i 1 1 1 M ! 1 1 : 1 1 

AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 


1590 


Qy 


1206 


TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 


1265 


Db 


1591 


MMMM MMMMMMMMMM MMIMIMM M 1 II 1 

TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 


1650 


Qy 


1266 


GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 


1317 


Db 


1651 


1 1 Mill MM MM III M II MINI MM IIIIIMII 

GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 


1710 


Qy 


1318 


TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 




Db 


1711 


1 MMMMMI MM 1 1 II 

CCCGTGCCCAAGCCCACCTGGGGCCCCATTAA 174 2 





RESULT 11 

BC001180 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC001180 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:2323 IMAGE : 33 56605 , mRNA, complete cds . 
BC001180 

BC001180. 1 GI : 12654682 
MGC . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1780) 
Strausberg, R . 
Direct Submission 

Submitted ( 11 -DEC-2000 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact : MGC help desk 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Genome Sequence Centre, 
BC Cancer Agency, Vancouver, BC, Canada 
inf o@bcgsc . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra . 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row: e Column: 10 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 37097. 

Locat ion/Qua 1 i f iers 

1. .1780 

/organism="Homo sapiens" 

/mol_type= " mRNA " 

/db_xref ="taxon: 9606" 

/clone="MGC:2323 IMAGE : 3356605 " 

/ 1 issue_type= " Eye , ret inoblas toma " 

/clone_lib= fI NIH_MGC_16" 

/lab_host= H DH10B-R" 

/note= "Vector : pOTB7" 

447. .1619 

/codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id= "AAHO 1 18 0.1" 
/db_xref="GI : 12654683" 

/ translat ion= "MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS " 
341 a 648 c 493 g 298 t 



Query Match 71.7%; 
Best Local Similarity 84.6%; 
Matches 1144; Conservative 



Score 975.6; DB 9 
Pred. No. 3.7e-167 
0; Mismatches 149 



Length 1780; 
Indels 59; Gaps 



2; 



Qy 



Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II III I I I I II lllllllllll III lllllllll llllllllllllll I 
4 4 2 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 5 01 



Qy 



Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I I I I ! I I !l I II I.'! I I I I I I I I I M Mill I I I I 1 I I I I | [ I f I llilllllll 
5 02 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 561 



Qy 126 AG CTGGTGAAG CGGAAG CG CAT CGAGG CCATT CG CGG C CAGATT CTGTC CAAG CTT CGG C 18 5 

I M 1 1 1 1 1 1 1 1 'I I II 1 , 1 1 1 1 1, II I II 1 1 lllllllllll lllllllllll I 1 1 1 



Db 


562 


AG CTGGTGAAG CGG AAG CG CAT CGAGG CCAT CCG CGG CCAGAT C CTGTC CAAG CTG CGG C 


621 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 


245 


Db 


622 


1 IMMMMIMMIMIIMM 1 II II II 1 M 1 1 1 1 1 II 1 1 1 Mill 11 II 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


681 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II IMMMI MMMMMMM MMMM MM MMMMMMIMI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


305 


Db 


682 


741 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 


365 


Db 


742 


1 MMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


801 


Qy 


366 


AAATCTATGATAAATTC?UVGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


802 


MMIMIM f 1 MMM 1 1 MMM IMMMI 1 IMMMI M 1 

AAATCTATGACAAGTTCAAGCAGAGTACAC^ 


861 


Qy 


426 


AG CT CCGGG AAG CGGTG C CGG AAC CTGTATTG CT CTCT CGGG CAGAG CTG CG C CTG CTGA 

MIMM MMMM II Mill II MMMM 1 1 1 1 1 M 1 1 1 1 1 1 1 MIMM 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


862 


921 


Qy 


486 


GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 : 1 1 ! 1 1 MMM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


545 


Db 


922 


981 


Qy 
Db 


546 
982 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM MMIMMMMMMIIMM Mill DIM II MMM 1 II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 
1041 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

IIIIIIIIMIIIIIIIIIIMIIII Ml Ml Ml II II Mill MIMM 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1042 


1101 


Qy 
Db 


666 
1102 


TC^GTGCCC^CTGTTCCTGTGACAGOLAAGATAA 

1 II IMMMI 1 1 1 1 1 1 f 1 1 1 1 1 1 1 II Mill II MIMM 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAAC^ 


725 
1161 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

Ml 1 1 MMMM IIIIIIIIIIIMIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMM 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1162 


1221 


Qy 

Db 


786 
1222 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 1 1 1 1 1 1 1 1 1 1 1 , 1 ! 1 1 1 1 1 1 ! 1 1 1 1 ' 1 1 1 1 Mill MIMIMMIMIMM 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 
1280 


Qy 


846 


CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 


905 


DO 


1 O O 1 


MMMM 1 


12 90 


Qy 

Db 


906 
1291 


CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 

MIMM III IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII llllllllll 

CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 


965 
1350 


Qy 
Db 


966 
1351 


TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 

MM IMMMI MMMMMMM M II II 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 MM 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 


1025 
1410 



Qy 1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

1 1 1 1 MINIM MM M MMMM MMI MMMMMMIMMMI I 

Db 1411 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 14 7 0 

Qy 108 6 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

MMMMMMI MMMMMI M M M M M M M M M M M 1 1 1 1 1 1 1 1 M M 

Db 14 71 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 153 0 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

MM MMMMMMMMMMMMMMMMMMMMMMMMMMMI 

Db 1531 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 15 90 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 126 5 

IMMMMI MMMMMMMMMM MMMMMM M I 1 1 I 

Db 1591 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 1650 
Qy 12 66 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I MMI MM MM Ml II II 1 1 1 M 1 1 II M I M I M 1 1 

Db 1651 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 1710 

Qy 1318 TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

I IMMMMI Mill I II 

Db 1711 CCCGTGCCCAAGCCCACCTGGGGCCCCATTAA 1742 



RESULT 12 

AGMTGFB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



AGMTGFB 1561 bp mRNA linear PRI 27-APR-1993 

Simian transforming growth factor-beta (TGF) mRNA, complete cds . 
M16658 

M16658 .1 GI : 176552 

growth factor; transforming growth factor-beta. 
Cercopithecus aethiops (African green monkey) 
Cercopithecus aethiops 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae,- 
Cercopithecinae; Cercopithecus . 
1 (bases 1 to 1561) 

Sharpies, K., Plowman, G.D. , Rose,T.M., Twardzik, D . R . and 
Purchio, A. F. 

Cloning and sequence analysis of simian transforming growth 

factor-beta cDNA 

DNA 6 (3) , 239-244 (1987) 

87246074 

3474130 

Original source text: African green monkey cells (cell line 
BSC-40) , cDNA to mRNA, clone pTGF-beta-2. 
Location/Qualifiers 
1. .1561 

/organism= " Cercopithecus aethiops " 
/mol_type="mRNA" 
/db__xref ="taxon: 953 4" 
262. .1434 

/note=" transforming growth factor-beta precursor" 
/codon_start=l 
/protein_id="AAA353 69 . 1" 



/db_xref="GI : 176553" 

/ trans la t ion= "MPPSGLRLLPLLLPLLWLLVTiTPSRPAAGLSTCKTIDMELVKRK 
RIETIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEIYDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSKDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
sig_peptide 262. .1095 

/note=" transforming growth factor-beta signal peptide" 
mat_peptide 1096. .1431 

/product=" transforming growth factor-beta" 

BASE COUNT 301 a 547 c 446 g 267 t 

ORIGIN 

Query Match 71.6%; Score 974.2; DB 9; Length 1561; 

Best Local Similarity 85.0%; Pred. No. 6.8e-167 ; 

Matches 1150; Conservative 0; Mismatches 143; Indels 60; Gaps 3; 

Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II Ml II I I II lllllllllll lllllllllllll IIIIIIIIIIIMI I 

Db 257 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 316 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I MM lllllll lllllllllllll Mill IMIIIIIIMIII 1 1 1 1 1 1 1 1 1 1 

Db 317 TGGTGCTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGG 3 76 

Qy 126 AG CTGGTGAAG CGGAAG CG CAT CGAGG C CATTCG CGG C CAGATTCTGT C CAAG CTT CGG C 18 5 

MMMMMIMMMMIMIMI MM lllllllllll lllllllllll MM 

Db 3 77 AGCTGGTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 43 6 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 245 

I 1 1 1 1 i I M 1 1 1 1 1 1 1 1 1 MM I IIIIIIIIMIIMIIIMI Mill II II I 

Db 437 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 4 96 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II MM I i 1 1 1 ! 1 1 1 M 1 1 1 IIIIMM MM II lllllllllll I 

Db 4 97 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAAC 556 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I [ I I J I I I I MM 
Db 557 CGGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 616 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

MMIIIMI II MINI I I I MUM IIIIMM I IIIIMM M I 

Db 617 AAATCTATGACAAGTTCAAGCAGAGC^ 676 

Qy 426 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 48 5 

lllllll Mill M M IIIIMM IIIIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllll 

Db 677 AGCTCCGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 73 6 

Qy 4 8 6 GGCTC^GTTAAAAGTGGAGC^GC^CGTGGAGCTATACC^GAAATACAGCAATGATTCCT 54 5 

I I I M M M 1 1 1 1 1 M 1 M M M 1 1 1 1 1 1 1 1 1 1 I f I M 1 1 1 1 1 1 1 1 1 1 M MM 

Db 737 GGCTCAAGTTAAAAGTGGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 7 96 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM ] 1 1 f i 1 1 1 1 1 ! 1 1 E 1 1 1 i I J [ ! I Mill MM MM M MM Mil 



Db 7 97 GGCGATACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTG 856 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

IIMIMMIMIMIMIIIIIMI 1 1 1 Mil Ml M M Mill lllllll 

Db 8 57 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCC 916 

Qy 666 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

I II llllllll IIIMIIIIIIIIIIIIIIIIMIM II Mill II lllllll 

Db 917 TTAGCGCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGT 976 

Qy 72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I llllllll IMIMMMMM lllll llllllllllllll lllllll 
Db 977 TCAGTACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGC 1036 

Qy 78 6 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I MMMMMIMI IMIMMMMM II Mill MMMMMMMMM 

Db 1037 TTCTCATGGCCACCCC^CTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA- 1095 

Qy 84 6 CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

llllllll I 

Db 1096 GCCCTGGACA 1105 

Qy 906 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II lllllll 
Db 1106 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACT 1165 

Qy 966 T C CGG AAGG A C CTGGG CTGGAAGTGGATT CATG AA C C CAAGGG CTA C CATG C CAATTT CT 1025 

MM MMMM MMMMMIMI M II MIMIMMMMMMM MM 

Db 1166 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 1225 

Qy 1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1085 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllll lllll 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 1226 GCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1285 

Qy 108 6 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

MINI . 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 

Db 12 86 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1345 

Qy 1146 AGCC^CTGCCCATCGTGTACTACGTGGGCCGCAAGCCC^GGTGGAGCAGCTGTCCAACA 1205 

1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 r i j 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 f 1 1 1 1 1 

Db 134 6 AGCC^CTGCCC^TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA.GCTGTCC^CA 14 05 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

llllllllll llllllll I I I I I I I I I ] I M I I I I I t I I I I I II I II I 

Db 14 06 TGATCGTGCGCTCCTGC^AATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAG 14 65 

Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill llllllllll III II I MMMMMMMMM! 

Db 14 66 GCCCGGCCCCGCCCCACCCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACA 1525 

Qy 1318 -TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

1 1 1 1 1 1 1 [] 1 1 1 1 1 1 1 MM I I II 

Db 1526 CCCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 1558 



RESULT 
E03028 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



E03028 1821 bp RNA linear PAT 29-SEP-1997 

DNA encoding human prepro TGF-betal. 

E03028 

E03028 . 1 GI :2171250 
JP 1991180192-A/l . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata,- Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1821) 

Ohashi,H., Ishii,Y., Miyata,Y., Miyazono,K., Miyagawa,K. and 
Takaku, F. 

PRODUCTION OF HUMAN PR0-TGF-BETA1 BY GENETIC RECOMBINATION 

Patent: JP 1991180192-A 1 06-AUG-1991; 

KIRIN BREWERY CO LTD 

OS Homo sapiens (human) 

PN JP 1991180192-A/l 

PD 06-AUG-1991 

PF 07-DEC-1989 JP 1989318243 

PI OHASHI HIDEYA, ISHII YASUYUKI , MIYATA YOSHINORI , PI MIYAZONO 
KOHEI , 

PI MIYAGAWA KIYOSHI, TAKAKU FUMIMARO 

C12P21/00,C07K13/00,C12N5/10,C12N15/l8//C12Ql/68, (C12P21/00, 
C12R1:91) , 

(C12N15/18 / C12R1: 91) ; 
strandedness : Double; 
topology: Linear; 
hypothetical: No; 
anti-sense: No; 

*source : t issue_type=placenta ; 
* source : clone=pVC19-TGF; 



PC 
PC 
PC 

cc 
cc 
cc 
cc 
cc 
cc 

FH 
FH 
FT 
FT 



Key 

5 'UTR 
CDS 



FT 

sig_peptide 511. 
FT mat_peptide 
FT 

FT mat_peptide 
FT 
FT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



3 'UTR 

Location/Qualifiers 
1. .1821 

/organism="Homo sapiens" 
/mol_type="genomic RNA" 
/db_xref = M taxon: 9606" 
326 a 679 c 508 g 308 t 



Location/ Qual i f iers 

1. .510 
511. .1683 

/product = 'human prepro TGF-betal 1 
.597 

598. .1680 

/note= 'human pro TGF-betal 1 
1345. .1680 

/product = 'human TGF-betal' 
1684 . . 1820. 



FT 



Query Match 71.4%; 
Best Local Similarity 85,0%; 
Matches 1135; Conservative 



Score 971.4; DB 6 
Pred. No. 2.1e-166 
0; Mismatches 141 



Length 1821; 
Indels 59; Gaps 



Qy 



Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

M Ml I M I M lllllllllll lllilllllllll llllllllllllll I 
507 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 566 



Qy 


66 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

1 llllllllllllllll MINIMI II M 1 1 ! 1 1 1 j 1 [ J 1 1 1 ] 1 Mllllllll 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 


125 


Db 


567 


626 


Qy 


126 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

1 1 ! 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 F 1 1 1 1 IIIIIIIMII lllllllllll MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


185 


Db 


627 


686 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 II 1 1 I 1 I Mill M II 1 
TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


245 


Db 


687 


746 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAAGCGGAGCCCGAGC 

1 M llllllll 1 1 1 1 1 M 1 II 1 1 1 1 llllllll MM llllllllllllllll 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


305 


Db 


747 


806 


Qy 


306 


CAGAGG CGGA CTA CTA CG CCAAGG AGGT CA C CCG CGTGCTAATGGTGGAAAG CGG CAA C C 

1 Mill 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 i J 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCGGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


807 


866 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACGCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

MINIUM II MMM 1 1 M II II llllllll 1 MINIM II 1 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 


425 


Db 


867 


926 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

IIIIIII llllllll II Mill II llllllll llllllllllllll IIIIIII 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


927 


986 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

IIIIIIIIIIIMIIIIIIIIIIIMIIIMIII IIIIIIIIIIIIIMM MINI 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


545 


Db 


987 


1046 


Qy 


546 


GGCGCTACCTCAGCAACCGGGTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM 1 1 1 1 1 < 1 ! 1 1 1 1 1 : 1 1 1 1 ! 1 1 1 Mill Mill II MMM 1 M MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 


Db 


1047 


1106 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

1 1 1 1 1 1 1 1 1 F 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 Ml III Ml II M Mill IIIIIII 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1107 


1166 


Qy 


666 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II MINIM 1 1 1 1 1 1 1 M 1 1 1 1 MM 'ML II INN II IIIIIII 

TTAGGGCCCACTGCTCCTGTGAC^GCAGGGATAACACACTGC^GTGGACATCAA.CGGGT 


725 


Db 


1167 


1226 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 
Ml 1 1 llllllll 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 llllllllllllll IIIIIII 


785 


Db 


1227 


III 1 1 llllllll 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II IIIIIII 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


1286 


Qy 
Db 


786 
1287 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 INN MINNNNNNNI 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 
1345 


Qy 


846 


CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 


905 


Db 


1346 


llllllll 1 


1355 



Qy 9 06 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

Mill 1 1 II II 1 1 1 1 1 M 1 1 1 1 1 1 1 1: 1 1 1 1 1 1 1 1 1 1 1 1 1; 1 1 1 llllllllll 

Db 13 56 C CAACTATTG CTT CAG CT CCA CGGAGAAGAA CTG CTG CGTGCGG CAG CTGTACATTGACT 1415 

Qy 966 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

I I I I Mllllll IIIIMIIIIIIII II II I MM II Ml I Ml II II I I I | I 
Db 1416 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 14 75 

Qy 1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

Mil II II II M MINIM MMMM Mill 1 1 M M M 1 1 1 1 M M M 1 1 I 

Db 147 6 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1535 

Qy 1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

llllll Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1536 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1595 

Qy 114 6 AGCC^CTGCCC^TCGTGTACTACGTGGGCCG(^GCCC^GGTGGAGC^GCTGTC(^a\ 12 05 

MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1596 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1655 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

IMIMIMI : l I II 1 1 1 1 1 1 II I Ml Ml MMMMMM Mill I 

Db 1656 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 1715 
Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGG CTGTATTTAAGGACA 1317 

I I Mill MM MM III II II I [[ 1 1 1 1 1 1 J 1 1 1 J I j 1 1 1 

Db 1716 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 1775 

Qy 1318 TCGTGCCCCAAGCCC 1332 

Mllllll I II 
Db 1776 CCGTGCCCCCAAGCC 17 90 



RESULT 14 

BC022242 

LOCUS 

DEFINITION 
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KEYWORDS 
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ORGANISM 
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TITLE 
JOURNAL 



REMARK 
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PRI 04-FEB-2002 
complete cds . 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BC022242 1746 bp mRNA linear 

Homo sapiens, clone MGC: 22008 IMAGE : 4399762 , mRNA, 
BC022242 

BC022242 .1 GI : 18490115 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates ; 
1 (bases 1 to 1746) 
Strausberg, R . 
Direct Submission 

Submitted ( 01-FEB-2002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact : MGC help desk 
Email : cgapbs -r@mail . nih .gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 



DNA Sequencing by: Sequencing Group at the Stanford Human Genome 
Center, Stanford University School of Medicine, Stanford, CA 94305 
Web site: http://www-shgc.stanford.edu 
Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M., Schmutz, J . , Grimwood, J., Rodriquez, A., and Myers, 
R. M. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 27 Row: e Column: 21 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

FEATURES Location/Qualifiers 
source 1. .1746 

/organism^ " Homo sapiens " 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone= "MGC: 22008 IMAGE : 43 9 9762 " 

/ 1 i s sue_type= " Duodenum , adenoca rc inoma " 

/clone_lib="NIH_MGC_8 8" 

/lab_host="DH10B" 

/note=" Vector: pCMV-SP0RT6" 
CDS 370. .1542 

/codon_start=l 

/product =" Unknown (protein for MGC:22008) n 
/protein_id="AAH22242 .1" 
/db_xref="GI : 18490116" 

/ trans la t ion = "MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEIYDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVXALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS " 

BASE COUNT 376 a 612 c 472 g 286 t 

ORIGIN 

Query Match 71.3%; Score 971; DB 9; Length 1746; 

Best Local Similarity 84.8%; Pred . No. 2.5e-166; 

Matches 1148; Conservative 0; Mismatches 145; Indels 60; Gaps 3; 
Qy 6 CCGAGATGG CG CCTTCGGGG CTG CGG CTCTTG CCG CTG CTG CTG CCG CTG CTGTGG CTG C 65 

II III 1 1 1 1 1 1 MINIUM Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 3 65 CCCCCATGCCGCCCTCCGGG CTG CGG CTG CTG CTG CTG CTGCTACCG CTG CTGTGG CTAC 4 24 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill I I I I I I I I I I I I II MINI I I I I 
Db 4 25 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 4 84 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMMIMI I II I 

Db 485 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 544 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I M 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 MMI II II I 

Db 54 5 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 604 



Qy 


246 


Db 


605 


Qy 


306 


Db 


665 


Qy 


366 


Db 


725 


Qy 


426 


Db 


785 


Qy 


486 


Db 


845 


Qy 


546 


Db 


905 


Qy 


606 


Db 


965 


Qy 


666 


Db 


1025 


Qy 


726 


Db 


1085 


Qy 


786 


Db 


1145 


Qy 


846 


Db 


1204 


Qy 


906 


Db 


1214 


Qy 


966 


Db 


1274 


Qy 


1026 


Db 


1334 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll llllllllllllll llllllll MM I I I I I 1 I I I I I I I I I I 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 664 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 724 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

1 1 1 1 1 1 1 1 1 1 II MMM I I MMM IIIIMII I IIIMII II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 78 4 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

lllllll llllllll II lllll II MMMM llllllllllllll lllllll 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 844 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIIIIIII MMM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 9 04 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 i I f 1 1 1 1 1 Mill lllll II MMM I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 964 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

. i Ml Ml Ml II M lllll M 1 1 1 1 1 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1024 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I II MMMM MINIMUM! MIMIIMM II Mill II lllllll 

TTAGGGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1084 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

Ml I I llllllll llllllllllllllllllll IMIMIMIMM IIIMII 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 114 4 

TCCTCATGGGCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I I I I I I I I I II I I I II II I I II I II I I I II II I lllll llllllllllllllllll 
TTCTCATGGCCAGCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCGCGGCACCGCCGA- 12 03 



MMMM I 

-GCCCTGGACA 1213 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

lllllll MIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIMIII llllllllll 

CC^CTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 12 73 
TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

MM MMMM Mill I || II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II I II MM 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 1333 
GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

MM MMMM MMMM llllllll lllll II I II II II 1 1 1 1 1 1 II I II I 

GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 13 93 



Qy 


1086 


Db 


1394 


Qy 


1146 


Db 


1454 


Qy 


1206 


UD 


1514 


Qy 


1266 


Db 


1574 


Qy 


1318 


Db 


1634 



TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

IMIIIIIIIIII MM MUM 1 1 1 1 1 1 M 1 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 14 53 
AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

MM MMMMMMMMMMMMMMMMMMMMMMMMMMMI 

AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1513 
TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

MMIMMI MMMMMMMMMM MMMMMM M I M I 

TGATCGTGCGCTCCTGCAAGTGCAGGTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 1573 
GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I MMI MM MM Ml M M MMMMMMMMMI 

GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 1633 



MMMMMMM MM 
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106216 
Sequence 
106216 
106216 . 1 



2 from Patent 



GI :590649 



1560 bp 
EP 0293785. 



DNA 



linear 



PAT 02-DEC-1994 



Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1560) 

Purchio, A. F. , Gentry, L. and Twardzik,D. 

Cloning and expression of simian transforming growth factor-SSI 
Patent: EP 0293785-A2 2 07-DEC-1988; 
Locat ion/Qua 1 if iers 



301 



1. .1560 

/ organ i sm= " unknown " 
a 547 c 442 g 



267 t 



3 others 



Query Match 71.3%; 
Best Local Similarity 84.9%; 
Matches 1145; Conservative 



Score 969.8; DB 6 
Pred. No. 4.3e-166 
0; Mismatches 143 



Length 1560; 
Indels 60 ; Gaps 



3; 



Qy 

Db 



1 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 7 0 

Ml MM M MMMMMI MMMMMMI MMMMMMM M Ml 

2 61 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 



Qy 

Db 

Qy 
Db 



71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

MIMIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MMMMMMM MINIM MINI 

321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 380 
131 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

I III MM 1 1 1 1 lllllllllll MIMIIII Mill Ml 

381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 



Qy 


191 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

MIIIIMIIII IIIIMI IIIIIIIIIMIIIIIilll lllll II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


250 


Db 


441 


500 


Qy 


251 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

IIIIIIII llllllllllllll IIIIIIII MM II Illllllllll II III 
TA CAA CAG CA CCCG CGA C CGGGTGG CCGGGG AGAGTG CGGAG C CGG AG C C CG AACCGGAG 


310 


Db 


501 


560 


Qy 


311 


GCGGA GTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

M 1 1 1 1 1 1 1 1 ! II 1 II 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 IIIIII 1 1 1 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


370 


Db 


561 


620 


Qy 


371 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

lllll II IIIIII 1 1 1 IIIIII IIIIIIII 1 IIIIIIII M IIIIII 

TATGAC^GTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


430 


Db 


621 


680 


Qy 


431 


CGGG AAG CGGTG C CGGAA C CTGTATTG CTCT CTCGGG CAGAG CTG CG C CTG CTGAGG CTC 

II Mill II II IIIIIIII MIIMII llllllllllllll IIIIIIMIMI 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


490 


Db 


681 


740 


Qy 


491 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

IIIIIHI ' IIIIIIII IIIIIIII IIIIIMIIIIIMIII IMIMIIII 

AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


550 


Db 


741 


800 


Qy 


551 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

1 II 1 1 1 1 1 1 II 1 M II 1 II 1 1 1 1 Mill MM lllllllll MM Mlllllll 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 


610 


Db 


801 


860 


Qy 


611 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

I.I Ml MM III II II lllll IIIIIIII II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


670 


Db 


861 


920 


Qy 


671 


GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

MIIMII lllll 1 1 1 1 [ 1 1 1 1 1 [ lllll: II Mill II llllllllll 1 

GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


730 


Db 


921 


980 


Qy 

Db 


731 
981 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 MIIMII llllllllllllll Mill llllllllllllll IIIIIIII III 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


790 
1040 


Qy 


791 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

IMIMMIMMMMMMMIM II lllll lllllllllllllllllll 

ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGAG 


850 


Db 


1041 


1095 


Qy 


851 


GATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACCAAC 

IIMIII IIIIII 


910 


Db 


1096 


1 i nq 

1 i U j/ 


Qy 


911 


TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGG 

1 1 1 1 mi iii 1 1 ii n i 1 1 ii 1 1 1 1 : 1 1 1 iiiiiiiiiiii ii iiiiMiiiii 

TACTGCTTCAGCTCCACGGAGAAGAACTGCTNCGTGCGGCAGCTGTATATTGACTTCCGC 


970 


Db 


1110 


1169 


Qy 


971 


AAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTG 

IIIIIIII MMIMIMIMI II II L II III lllll II lllllllll 

AAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTG 


1030 


Db 


1170 


1229 



Qy 1031 GGG CCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC 10 90 



■ 1 1 1 1 1 1 1 M 1 1 1 II I II Mill lllll IIIIMM IMIIIMM MINI 

Db 123 0 GGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTAC 128 9 

Qy 10 91 AACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 1150 

Lllll llllllilli . 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 II Ml 1 1 1 1 ! II I : I II I 

Db 12 9 0 AACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 134 9 

Qy 1151 CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 1210 

II MM llllll III Ml IIMMIMI I III I I III MIMIIII 1 1 MM II II! 

Db 13 50 CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 14 0 9 

Qy 1211 GTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAG 127 0 

lllll Mill I I IMMMMIM II IIIIMM Mill I III 

Db 1410 GTGCGCTCCTGAAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGGCCG 1469 
Qy 1271 GCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGT 1321 

MM llllllilli III II I III llllll lllll Mill III 

Db 14 7 0 GCCCCGCCCCACCCCACGCCCGCTGTGTTGCCCTTGGGGGCTGTATTTAAGGACACCCGT 152 9 

Qy 13 22 GCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

1 1 1 1 1 1 : 1 1 1 1 1 mm i i ii 

Db 1530 GCCCCAAGCCCACCTGGGGCCCCATTAA 1557 



Search completed: October 27, 2003, 18:35:20 
Job time : 5318.15 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 10:47:27 ; Search time 408.887 Seconds 

(without alignments) 
8985.201 Million cell updates/sec 

Title: US-10-017-372E-3 8 

Perfect score: 1361 

Sequence: 1 tggtaccgagatggcgcctt cgattaaagcggccgcgact 1361 

Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2552756 seqs, 1349719017 residues 

Total number of hits satisfying chosen parameters: 5105512 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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embl/NA1981 .DAT: * 
embl /NA19 82 . DAT : * 
embl/NA1983 .DAT: * 
embl/NA1984 .DAT: * 
embl /NA1 98 5 .DAT: * 
embl / NA1 986. DAT : * 
embl/NA1987 .DAT: * 
embl/NA1988 .DAT: * 
-embl/NA198 9 . DAT : * 
-embl /NA1 990 . DAT : * 
- embl / NA1 991. DAT : * 
-embl/NA1992 .DAT: * 
-embl /NA1 9 93 .DAT:* 
-embl /NA1 994 . DAT : * 
-embl/NA1995 .DAT: * 
-embl/NA1996 .DAT: * 
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-embl/NA1998 .DAT: * 
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- embl / NA2 00 0. DAT : * 

- embl /NA2 0 0 1A . DAT : * 

- embl /NA2 0 0 IB . DAT : * 
- embl / NA2 002. DAT : * 

- embl /NA2 003 .DAT: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAD22696 

ID AAD22696 standard; cDNA; 1326 BP. 
XX 

AC AAD22 696; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) cDNA. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 
KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 
KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 
KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; ss . 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 
FT CDS 16.. 1188 

FT /*tag= a 

FT /product = "Porcine TGF-betal mutant protein" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 20-APR-2001; 2 0 0 1W0-US12 9 8 0 . 
XX 

PR 20-APR-2000; 2 000US- 199014 P . 
XX 



PA (USSH ) US DEPT HEALTH & HUMAN SERVICES . 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR P-PSDB; AAE13596. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Claim 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is a cDNA encoding porcine TGF-betal mutant. 

XX 

SQ Sequence 1326 BP; 263 A; 438 C; 392 G; 233 T; 0 other; 

Query Match 88.8%; Score 1208.6; DB 24; Length 1326; 

Best Local Similarity 95.4%; Pred. No. 4.4e-235; 

Matches 1297; Conservative 0; Mismatches 9; Indels 53; Gaps 3; 
Qy 1 TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 60 

1 1 M 1 1 1 1 1 1 1 1 II I 1 1! 1 1 1 Ml 1 1 1 II I ! 1 1 II 'I M 1 1 1 1 1 1 II 1 1 1 INI 1 1 I 

Db 6 TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 65 

Qy 61 GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 12 0 

II MM II III II lllllll III IIMI Mill I Mill II! IMIII: llllll III 

Db 66 GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 12 5 

Qy 121 (^TGGAGCTGGTGAAGCGGAAGCGC^TCGAGGCC^TTCGCGGCC^GATTCTGTCCAAGCT 18 0 

MMMMMMMMMMMMMMMMMMIMMMMMMMMMMMI 

Db 126 CATGGAG CTGGTGAAG CGGAAG CG CAT CG AGG CCATT CG CGGC CAGATT CTGTC CAAG CT 18 5 

Qy 181 TCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGT 24 0 

MMM MMMMMMMMMMMMIMMMMMMMMMMMMI 1 1 

Db 186 TCGGCTCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGT 24 5 

Qy 241 ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 3 00 

M I! MM I M MM MM 1 1 1 M I III 1 1 II 1 1 1 1 II II I II 1 1 1 II II I M II I M III 

Db 24 6 ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 3 05 



Qy 


301 


Db 


306 


Qy 


361 


Db 


366 


Qy 


421 


Db 


426 


Qy 


481 


Db 


486 


Qy 


541 


Db 


546 


Qy 


601 


Db 


606 


Qy 


661 


Db 


666 


Qy 


721 


Db 


726 


Qy 


781 


Db 


786 


Qy 


841 


Db 


846 


Qy 


901 


Db 


855 


Qy 


961 


Db 


915 


Qy 


1021 


Db 


975 


Qy 


1081 


Db 


1035 



I II M I II I II M I I II II I M I II Ml II II I II I II I I II M I M I Ml I M I I I I 



GAACCAAATCTATGATAAATTCAAGGGCACCCCCCACAGCT^^ 4 2 0 

I M i I III IM II 1 1 II II M I II II I MM M I II II II III M I M I Ml I 1 1 M 1 1 

CAACCAAATCTATGATAAATT 42 5 

GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 4 8 0 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 , 1 1 1 1 1 I 

GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 4 8 5 
GCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGA 54 0 

M 1 1 1 1 1 . 1 1 ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 i 1 1 i I i ! 1 1 ! i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I M 1 1 1 

GCTGAGGCT CAAGTTAAAAGTGGAG CAG CA CGTGGAG CTATAC CAGAAATACAG CAATGA 54 5 
TTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTC 600 

MIMM Mill MM MIMIIIIIIII Ml MMMIII lllllll Mill II III I 

TTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTC 6 05 
CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 660 

I II M I II I II 1 1 M 1 1 M I II II I II III M 1 1 1 1 1 1 1 1 M 1 1 1 II II II I II M II 1 1 

CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 665 
TCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAA 720 

IMMMIMMIMM MMI 1 1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAA 725 
CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTT 78 0 

I MMI II 1 1 II I II 1 1 M 1 1 M II II III 1 1 1 1 1 1 II I II II 1 1 III III II II II 1 1 

CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTT 78 5 
CCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCG 84 0 

I II 1 1 1 1 1 1 II 1 1 M I M 1 1 II II I II III M 1 1 1 1 1 II I 1 1 III III II I II 1 1 M 1 1 

CCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCG 84 5 



III 



GGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACAT 960 

1 1 II M I II II 1 1 1 II II II 1 1 II 1 1 I II 1 1 1 1 M I MM I M I M I II M M 1 1 M I 

GGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACAT 914 

TGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAA 1020 

I II II Ml I II I II 1 1 1 1 II II I II III M 1 1 1 1 1 1 1 1 II I Ml III II I II 1 1 M 1 1 

TGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAA 974 

TTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCT 108 0 

II II I II I II 1 1 II I II II II I MM II II 1 1 1 1 1 M MM I M 1 1 M III 1 1 1 1 1 M I 

TTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCT 1034 

GGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGC 114 0 

: 1 1 1 M I M I II Ml I 1 1 1 1 ! Ill 1 1 1 1 1 1 1 1 1 

GGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGC 10 94 



Qy 1141 GCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTC 12 0 0 

1 1 ' 1 1 II I ; 1 1 II I II IM I i 1 1 1 1 1 1 1 1 1 1 1 1 1 i II , III 1 1 1 1 II 1 1 1 h I II 1 1 1 

Db 1095 GCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTC 1154 

Qy 12 01 O^CATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCC 1260 

I I I I I I I I ! I I I I I I I I I I I I I I I I M I I I I I lllllllllllllllllllllllll 

Db 1155 CAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA-GCCCCGCCCCGCCCACAGCCCCGCC 1213 

Qy 12 61 CACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCG 132 0 

I I I I II I II I Ml II 1:1 I I I I I II I I I I I I I I I II I III I I I I II H M I II I I 
Db 1214 CACCCGGCAGGCCCGGCCCCZACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCG 1273 

Qy 1321 TGCCCCAAGCCCACTTGGGATCGATTAAAGCGGCCGCGA 1359 

1 1 1 1 1 1 1 1 M 1 1 1 1 lllllllllllllll I I II 

Db 12 74 TGCCCCAAGCCCAC - TGGGATCGATTAAAGGTGGAGAGA 1311 



RESULT 2 
ABQ76674 

ID ABQ76674 Standard; DNA; 2527 BP. 
XX 

AC ABQ76674; 
XX 

DT 26-MAR-2003 (first entry) 
XX 

DE Androgen receptor signalling pathway-associated DNA E00973 . 
XX 

KW Androgen receptor; transactivation; modulator; Smad3 ; Smad4 ; Akt; TGF-B; 
KW signal transduction pathway; transforming growth factor-B; phosphatase; 
KW tensin; cytostatic; antiproliferative; cellular proliferation; cancer; 
KW E00973; ds . 
XX 

OS Synthetic. 
XX 

PN WO200282081-A2 . 
XX 

PD 17-OCT-2002. 
XX 

PF 05-APR-2002; 2002WO-US11086 . 
XX 

PR 06-APR-2001; 2001US-282266P. 
PR 13-MAR-2002; 2 0 02US -365 060P . 
XX 

PA (UYRP ) UNIV ROCHESTER. 
XX 

PI Chang C; 
XX 

DR WPI; 2003-046871/04. 
XX 

PT Modulating androgen receptor activity, by administering a compound that 

PT modulates receptor activity, inhibits receptor-signal transduction 

PT pathway/receptor-coactivator interaction or changes amount or receptor 

PT 

XX 

PS Disclosure; Page 225-226; 302pp; English. 
XX 

CC This invention describes a novel method for modulating androgen receptor 



CC activity or androgen receptor-mediated transactivation activity in a 

CC cell. The method involves administering a compound which causes 

CC modulation of the androgen receptors activity and the inhibition of 

CC interaction between the receptor and a protein involved in a signal 

CC transduction pathway. The compound also inhibits the interaction between 

CC the androgen receptor and a protein selected from Smad3 , Smad4 , Akt, 

CC transforming growth factor (TGF) -B and phosphatase and tensin homologues 

CC deleted on chromosome 10 (PTEN) or their fragments. The compounds of the 

CC invention have cytostatic and antiproliferative activity. The obtained 

CC composition is useful for treating any disease, where uncontrolled 

CC proliferation or cellular proliferation occurs such as cancer, e.g. 

CC prostate cancer. This sequence represents the androgen receptor 

CC transactivation signalling pathway modulator E00973 described in 

CC the method of the invention. 

XX 

SQ Sequence 2527 BP; 472 A; 888 C; 735 G; 432 T; 0 others- 
Query Match 72.2%; Score 982.8; DB 25; Length 2527; 
Best Local Similarity 85.1%; Pred. No. 2e-189; 

Matches 1144; Conservative 0; Mismatches 147; Indels 53; Gaps 2; 
Qy 6 CCGAGATGG CG C CTTCGGGG CTG CGG CT CTTG CCG CTG CTG CTG C CGCTGCTGTGG CTG C 65 

II III I 1 1 1 II lllllllllll lllllllllllll INI III MM II! I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I II III III III III II MINIMI Mill IIIIIIIIIIMM MINIUM 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 9 56 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

MM II III Ml III M MM MMlll II lllllllllll lllllllllll 1 1 1 1 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I Mill Ml Ml MIMI III III Ml MUM MM MM II I Mill II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II MINIM lllllllllll IIIIMM Mil 1 1 1 1 1 1 I i 1 1 i 1 1 1 

Db 1077 CCCTCTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I INN N Ml INN I III MMlll NUN I MM INI INN II I Nil 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

Qy 366 AAATCTATGATAAATTC^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

MINIM II MINI I I MUM IIIIMM I IIIIMM II I 

Db 1197 AAATCTATGACAAGTTCAAGCAGAGTACACAC^ 12 56 

Qy 426 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

MMlll NNI II NIM II IIIIMM 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 MMlll 

Db 12 57 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

Qy 4 86 GGCTC^GTTAAAAGTGGAGC^GCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 II 1 1 1 1 MINI 

Db 1317 GGCT(^GTTAAAAGTGGAGCAGC^CGTGGAGCTGTACC^GAAATA(^GaU\CAATTCCT 13 76 



Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM MMMMMMMMMMM! MMI MMI II IMMI I II 1 1 1 1 

Db 1377 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 

Qy 6 06 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 III III III II II Mill lllllll 

Db 14 37 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

Qy 666 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I II llllllll lllllllllllll lllllllllll II Mill II lllllll 
Db 14 97 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

Qy 72 6 TCAATTCTGGCCGCCGGGGTGACGTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I MMMM MMMMMMMMMM 1 1 II 1 1 1 1 1 1 1 1 1 1 lllllll 

Db 1557 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I MIIMIIIIIIIIIIIIMIIIIIMIIIII Mill I 

Db 1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

Qy 84 6 CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

MMMM I 

Db 1676 GCCCTGGACA 168 5 

Qy 906 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

lllllll MIMMIMIMMIIIMMIMMMIIIIMIIMII Illlllllll 
Db 1686 CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 1745 

Qy 966 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 102 5 

MM llllllll .1 1 1 1 1 .11 1 1 1 1 ! II II IIIMIIIIIIIIIMMII MM 

Db 174 6 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 05 

Qy 102 6 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1085 

MM MMMM MMMM MMMM Mill 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1865 

Qy 108 6 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

I I II I I II M II I II I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I II I II I I I I I I 
Db 18 66 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 192 5 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1205 

MM MIMMIMMIIMIIIIIMIIIIIMIIMMMIMI IMMIIMIM 

Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGGAGCTGTCCAACA 198 5 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

MIIIIMM MMMMMMMMMM lllllllllll I lilllllll II 

Db 198 6 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC- -CCGCCCCGCCCGGCC 2 043 

Qy 12 66 GGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCC 1325 

I I I II II I III I II IIMMIIMIIIIIIIII lllllll 
Db 2 044 CCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCC 2103 

Qy 13 2 6 CAAGCCCACTTGGGATCGATTAAA 13 4 9 

lilllllll MM I I II 

Db 2104 CAAGCCCACCTGGGGCCCCATTAA 2127 



RESULT 3 
AAN60972 

ID AAN60972 standard; cDNA; 2537 BP. 
XX 

AC AAN60972; 
XX 

DT 31-OCT-2002 (updated) 

DT 28-OCT-1991 (first entry) 

XX 

DE Sequence encoding preTGF-beta. 
XX 

KW Transforming growth factor beta; cancer; wound healing. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 
FT misc_structure 37.. 113 
FT /*tag= a 

FT /note= "Sequence can form stable hairpin loops" 

FT CDS 842.. 2014 

FT /*tag= b 

FT mat_peptide 1676.. 2011 

FT /*tag= c 

XX 

PN EP200341-A. 
XX 

PD 10-DEC-1986. 
XX 

PF 21-MAR-1986; 86EP- 03 02 1 12 . 
XX 

PR 22-MAR-1985; 85US- 0715142 . 
PR 13-MAR-1987; 87US- 0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA; 
XX 

DR WPI; 1986-326875/50. 
DR P-PSDB; AAP61468 . 
XX 

PT TGF-beta prodn. from transformed hosts - useful esp. for treating 

PT wounds <J6 2/9/86) . 

XX 

PS Disclosure; Fig lb; 26pp; English. 
XX 

CC The gene product is known to stimulate cell proliferation and 
CC inhibit anchorage -dependent growth of a variety of human cancer cell 
CC lines, it is esp. useful in treatment of burns and the promotion of 
CC surface and internal wound healing. TGF-beta may be expressed from a 
CC transformed CHO cell line. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 72.2%; Score 982; DB 7; Length 2 537; 

Best Local Similarity 84.9%; Pred . No. 3e-189; 

Matches 1148; Conservative 0; Mismatches 145; Indels 59; Gaps 2; 



Qy 

Db 


6 

837 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II III MM M MINI 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


65 
8 96 


Qy 


66 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

1 Mllllllllllllll 1 1 1 1 1 1 1 1 1 Mill Ml. II M Ml IIIIIIIIII 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 


12, 5 


Db 


897 


956 


Qy 


126 


AGCTGGTGAAGCGGAAGCGCATGGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

IIIMM IM III MINIMI 1 MM 1 IIMMIMM II 1 MM MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


185 


Db 


957 


1016 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 MIMIIMM 1 1 Mill t 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 ; i Mill II II 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


245 


Db 


1017 


1076 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II IIIIMM IMMMMMMI IIIIIIII MM Mllllllllllllll 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


305 


Db 


1077 


1136 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

CTGAGG GCGACTACTACGC CAAGGAGGT CA C C CG CGTG CTAATGGTGGAAAC C CACAACG 


365 


Db 


1137 


1196 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

IIIIIIIIII II llllll 1 1 MUM IIIIIIII 1 IIIIIIII II 1 

AAATCTATGACAAGTTCAAGCAGAGTAGACACAGCATATATATGTTCTTCAACACATCAG 


425 


Db 


1197 


1256 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


485 


Db 


1257 


IIIMM Mllllll II Mill II Mllllll IIIIIIIIIMIII IIIMM 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


1316 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

MMIMMIMM IMIIIIIIIIMIIIM llllll IMMMIII llllll 

GGCTCAAGTTAAAAGTGGAGCAGCAGGTGGAGCTGTACCAGAAATACAGCAAGAATTCCT 


545 


Db 


1317 


1376 


Qy 


546 


GGGGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM IIIIIIIIIIIIIMIIIIIIII Mill Mill II llllll 1 II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 


Db 


1377 


1436 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

IMMMMMMI 1 i Mill III Ml III II II Mill IIIMM 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1437 


1496 


Qy 


666 


TCAGTGCGCAGTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 M IIIIIIII 1 II 1 M II II 1 II II 1 1 1 II 1 1 II II II 1 II II II 1 II II 
TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


725 


Db 


1497 


1556 


Qy 
Db 


726 
1557 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III 1 1 IIIIMM llllllllllllllllllll MMIMMIMM IIIMM 

TCACTACCGGCCGCCGAGGTGACCTGGCCAGCATTCATGGCATGAACCGGCCTTTCCTGC 


785 
1616 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 IIIMM Mill MMMIMMMIMM Mill 1 1 II II 1 1 1 1 1 II II II 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1617 


1675 



Qy 84 6 CCCTGGATACC^VACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

llllllll I 

Db 1676 GCCCTGGACA 1685 

Qy 9 06 CCAACTACTGCTTCAGCTCCAGGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

11:11 M I i II I II I I M I I I I I I I I I I I ,1 M | | | M || Mil 

Db 1686 CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 1745 

Qy 966 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

I I I I llllllll IMIIIIIIIIIII II II IIIIMIIIIIIIIIIIIII I I I I 
Db 1746 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 05 

Qy 102 6 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1085 

I I I I llllllll llllllll llllllll Mill 1 II I I I I I I I I I I I I ,1 I II I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 18 65 

Qy 1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

II I I 1 I II I I I ! I 1111111,111 , IMI i I ; I I I I II i I I 1 1 I I I II i I i I I 1 1 I I 

Db 18 66 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 

Qy 114 6 AGCCA.CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

I I I I 1 1 1 1 J 1 1 1 : 1 ,1 1 1 1 ! 1 1 ! I M 1 1 1 J ! 1 1 1 1 ! 1 1 - 1 ! 1 1 1 1 : || | | I I | | | I I 

Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1985 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCGCACCC 12 65 

IIMIMIII IIIIMIIIIIIIIIIIIII I I I I I I : I M I I M I II I 

Db 1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2 04 5 

Qy 1266 GGCAGGCCCGGCCCCAGCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill MM MM Ml II M II 1 1 1 1 II II 1 1 II M 1 1 1 

Db 204 6 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 

Qy 1318 TCGTG CCC CAAG C C CACTTGGGAT CG ATTAAA 134 9 

1 1 ! 1 1 ! 1 1 1 1 1 1 1 1 i I MM I I II 

Db 2106 CCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 2137 



RESULT 4 
AAQ03301 

ID AAQ03301 standard; DNA; 2537 BP. 
XX 

AC AAQ033 01; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-AUG-1990 (first entry) 

XX 

DE cDNA encoding human pre- transforming growth factor-beta - 1 (pre-TGF-beta - 

DE 1) . 

XX 

KW Transforming growth f actor-beta-1 (TGF-beta-1) ; 

KW neoplastic cell line inhibition; 

KW EGF-potentiated anchorage- independent growth; 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842 . .2014 



FT /*tag= a 

FT mat_peptide 1676. .2011 

FT /*tag= b 

FT misc_dif f erence 37.. 113 

FT /*tag= c 

FT /note= M stable hairpin loops" 

FT misc_feature 2015.. 2100 

FT /*tag= d 

FT /note="G-C rich sequence 

FT and a downstream TATA- like sequence" 

XX 

PN US4886747-A. 
XX 

PD 12-DEC-1989. 
XX 

PF 13-MAR-1987; 87US - 0025423 . 
XX 

PR 13-MAR-1987; 87US - 002 5423 . 
XX 

PA (GETH ) GENENTECH INC. 



XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR P-PSDB; AAR05258 . 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC It was obtained by an analysis of several overlapping cDNAs and gene 

CC fragments, leading to the detn. of a continuous sequence corresp. to the 

CC TGF-beta-1 precursor mRNA. It is useful in constructing vectors that 

CC encode biologically active transforming growth factor (TGF-beta) , 

CC operably linked to DNA that encodes a secretory leader (SL) . It, or a 

CC nucleic acid capable of hybridising with it, can also be labelled and 

CC used in diagnostic assays for DNA or mRNA encoding TGF-beta or related 

CC proteins. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 

Query Match 72.2%; Score 982; DB 11; Length 2537; 

Best Local Similarity 84.9%; Pred . No. 3e-189; 

Matches 1148; Conservative 0; Mismatches 145; Indels 59; Gaps 2; 

Qy 6 CCG AG ATGG CG CCTT CGGGG CTG CGG CT CTTG C CG CTG CTG CTG C CG CTG CTGTGG CTG C 65 

II III INI II IIIIIIMIII IMIIIIIIIIII llllllllllllll I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 896 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I llllllllllllllll lllllllll Mill llllllllllllll llllllllll 
Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 



Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 



Db 


957 


Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 


Db 


1676 


Qy 


906 


Db 


1686 


Qy 


966 



1 1 1 1 1 1 1 1 II I II . 1 1 1 II , I II 1 1 1 1 1 1 IMIIIIIMI lllllllllll MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 245 

I i I I 1 I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I II I I I I I I Mill M II I 
T CG C CAG C C C CC CGAG C CAGGGGG AGG TG C CG C C CGG C C CG CTG CCCGAGG CCGTG CT CG 1076 



II 



CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

AAATCTATGATAAATTCAAGGGCA.CCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 425 

IIIIIIIIII II MUM I I llllll I M M M I I II M II M II I 

AAATCTATGACAAGTTC^GC^GAGTAC^CAC^GCATATATATGTTCTTCAACACATCAG 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

MMMI 1 1 1 1 1 1 1 1 II Mill II IIIIIIM llllllllllllll MIMM 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 L 1 1 ! 1 1 1 1 1 1 i 1 1 1 ) I MMMI llllll 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 1376 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACGGGAGTGGCTGTCCTTTG 605 

MM 1 1 1 1. 1 II 1 1 M I II: I II ! 1 1 1 Mill Mill II llllll I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 14 3 6 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

. I M 1 1 ' I II 1 1 1 1 1 1 1 1 III 1 1 1 1 1 III Ml III II II Mill MMMI 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I II IIIIIIM MMMI III IMIIIIIMI II IMM M Illllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I IMIIMI IIIIIIMIIIIIIIIMM MM 1 1 1 1 1 1 1 1 1 Illllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I II III I I IIIIMIIMIMII II MM I IMM IIMIIIIIII llllll 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 



Illllll 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

Illllll II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 II I III 1 1 1 1 1 M IIMIIMM 

CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 174 5 
TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

MM llllllll IMMMMIMM II M IMM MM 



Db 



174 6 TCCGOUVGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 05 



Qy 102 6 GCCTGGGGCCGTGTCGGTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

Mil MINIM IMIMII MINIM INN II II II II 1 1 1 II I II II 1 1 I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1865 

Qy 108 6 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

1 1 1 1 1 1 1 1 1 1 1 E 1 INIINNM Ml ! Ml M ,1 1 M 1 1 1 1 1 M 1 1 1 1 1 M Mi 1 1 

Db 186 6 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1205 

Mil 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 M 1 1 1 1 M 1 1 

Db 192 6 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGGTGTCCAACA 1985 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 12 65 

.LI I I I I I I I I I I I I I I I I I I I I lllllllllll II I II I 

Db 198 6 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2 04 5 

Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill MM INI III II II I III INI III II I III II 

Db 2 04 6 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 

Qy 1318 TCGTGCCCCAAGCCCAGTTGGGATCGATTAAA 134 9 

1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 INI I I II 

Db 2106 CCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 2137 



RESULT 5 
AAQ02814 

ID AAQ02814 standard; cDNA; 2537 BP. 
XX 

AC AAQ02814; 
XX 

DT 25-MAR-2003 (updated) 
DT 31-OCT-2002 (updated) 
DT 31-MAY-1989 (first entry) 
XX 

DE Sequence of pre-TGF-betal cDNA. 
XX 

KW Transforming growth factor beta-3 (TGF beta 3); tumour cells; growth 

KW inhibition . 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842 . .2011 

FT /*tag= a 

FT /label=pre-TGF beta 1 

FT CDS 1677 . .2011 

FT /*tag= b 

FT /label=mature TGF-beta 1 

FT GC_signal 2015.. 2092 

FT /*tag= c 

FT misc_feature 2093.. 2099 

FT /*tag= d 

FT /label =TATA- like sequence 

FT stem_loop 37.. 113 



FT /*tag= e 

FT misc_feature 863.. 911 

FT " /*tag= f 

FT /label=hydrophobic domain 
XX 

PN WO8912101-A. 
XX 

PD 14-DEC-1989. 
XX 

PF 08-JUN-1988; 8 8WO-US0194 5 . 
XX 

PR 08-JUN-1988; 8 8WO-US0 1 94 5 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Dernyck RMA, Goeddel DV; 
XX 

DR WPI; 1990-007474/01. 

DR P-PSDB; AAR04034. 
XX 

PT Nucleotide sequence encoding transforming growth factor beta-3 -used as a 

PT probe, or to produce TGF beta 3, for growth inhibition of certain normal 

PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. lb; 61pp; English. 
XX 

CC Sequence encodes the 390 amino acid (AA) precursor transforming growth 

CC factor-beta 1 (pre-TGF-beta 1) polypeptide. The 5' untranslated region of 

CC the TGF-beta 1 mRNA is 841 bases long, is purine rich and has a region of 

CC potential secondary structure. The TATA- 1 ike sequence in the 3' untrans- 

CC lated region of the gene is presumably a polyadenylation signal. Mature 

CC TGF-beta 1 comprises the C- terminal 112 AA's of pre-TGF-beta 1 and is 

CC cleaved at the Arg-Arg dipeptide preceding its NH2 terminus. The nucleic 

CC acid encoding the second subtype of TGF-beta (TGF-beta 3) is useful as a 

CC probe or to produce TGF-beta 3 for inhibition of growth of normal and 

CC neoplastic cells. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 72.2%; Score 982; DB 11; Length 2537; 

Best Local Similarity 84.9%; Pred. No. 3e-189; 

Matches 1148; Conservative 0; Mismatches 145; Indels 59; Gaps 2; 

Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III I I I I II IMIIIIMII IIIMIIIIIIII I I I I I I ' I I , I II I I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I Mill I 'I lllllll MINIMI Mill IMMIMIIMM MINIUM 

Db 897 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

M II 1 1 M I Ml I M II II Ml 1 1 1 1 II M Mill IMM II I MM MM 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 



Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I IIIIIIIIIIIIIIIIIIIMII llllllllllllllllllll Mill II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II IIMIMI IIIIIIIIIIIIII IIIIIIII MM I M I : II I II 1 1 1 II I 

Db 1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill 1 1 1 1 1 1 IM 1 1 1 1 1 1 1 1 II, I IM 1 1 1 1 II I M 1 1 1 1 1 1 1 : II I MM 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

Qy 3 66 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

MM II II MUM I I MUM IIIIIIII I IIIIIIII II I 

Db 1197 AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 12 56 

Qy 42 6 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 485 

IIIIIII IIIIIIII II Mill M IIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIIII 

Db 1257 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

Qy 4 86 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 1 1 1 1 i 1 1 1 1 1 E i 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 [ I 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 MUM 

Db 1317 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 1376 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM IIIIIIIMI IIIIIII MM Mill Mill II MMII I M MM 

Db 13 77 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 143 6 

Qy 6 06 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

[ I [ J 1 1 1 E ! 1 1 1 i 1 1 1 1 ] 1 1 E 1 1 1 j I III III III II II Mill MIMM 

Db 143 7 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

Qy 666 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I II II M II II lllllllllllll lllllllllll II Mill II MIMM 

Db 14 97 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

Qy 726 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I IIIIIIII M II M II M M M II II M IIIIIIIIIIIIII IMIIII 

Db 15 57 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I MIMIMMIII IIIIIII IIIMIIIII Mill 1 1 1 1 1 II I II MM I M I 

Db 1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

Qy 84 6 CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 9 05 

IIIIIIII I 

Db 1676 GCCCTGGACA 1685 

Qy 9 06 CC^CTACTGCTTCAGCTCO^CGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

IMIIII 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 Ml II M 1 1 1 M 1 1 II 1 1 1 1 1 1 Mllllllll 

Db 168 6 CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 174 5 

Qy 966 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 102 5 

MM IIMIMI I IIIMIMIMI II M 1 1 1 1 II II 1 1 M 1 1 1 1 1 1 1 1 MM 

Db 1746 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 0 5 



Qy 102 6 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1085 

1 1 1 1 MINIM MMMM MMMM MMI MMMMMMMMMM I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1865 

Qy 1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

M M II 1 1 M II I I M M 1 1 1 1 M I M M 1 1 1 II M M M I M II I M I II I II 1 1 M 

Db 1866 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 II II II M I II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II M 1 1 

Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1985 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 12 65 

I II I MUM MMMMMMMMMM I II Ml MM II II I II I 

Db 1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2045 
Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill MM MM Ml II II MM MMMM Ml I Ml 

Db 2 04 6 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 

Qy 1318 TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

Ml Mill MM MM MM I III 

Db 2106 CCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 2137 



RESULT 6 
AAT15720 

ID AAT15720 standard; cDNA; 2537 BP. 
XX 

AC AAT15720; 
XX 

DT 25-MAR-2003 (updated) 

DT 24-JUL-1997 (revised) 

DT 25-JAN-1980 (first entry) 
XX 

DE Pre- transforming growth factor beta 1 cDNA. 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production; ss . 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT 5'UTR 1..841 
FT /*tag= a 

FT misc_feature 37.. 113 
FT /*tag= b 

FT /note= "GC-rich region forms stable hairpin loops; 

FT similar to structural organisation of c-myc RNA, 

FT could play role in mRNA stability or in 

FT regulation of transcription" 

FT CDS 842 . .2014 

FT /*tag= c 

FT /product = pre-TGF_beta_l 

FT mat_peptide 1676.. 2011 

FT /*tag= d 

FT /product= mature_TGF_beta__l 



FT 


iTPDPat - reoion 

^ v^. v_* ca o -i- i 


2015 . .2100 


FT 




/*tag= e 


FT 




/nntp= "HP-rich rpoion • nn^sihlv rp^nmmihlp for the 


FT 




fact 3 ' UTR of mRNA could not be cloned as cDNA; 


FT 




may be important for transcription efficiency" 


FT 


rpneat unit 


2019 2023 


FT 




/*taq= f 


FT 


TATA s igna 1 


2094 . .2100 


FT 






FT 




/nnf p= "TATA-like spnupnce* no pvidpnrp that" this 


FT 




functions a promoter" 


FT 


oolvA si anal 


2514 2520 


FT 




/*tag= h 


FT 


mi cr ^icrnal 

HUD 1 - ox^-jiicaj. 


J> J • . J JU 


FT 




/*tag= i 


FT 




/notp— "rnn^pn^u^ ^pnnpnrp immediatplv nrprpdpc! 


FT 




nnl vA -fail ( Rpnni ^ t pt a1] " 


XX 






PN 


US5482851 -A 




XX 






PD 


0 9 - JAN -19 96 




XX 






PF 


05-NOV-1993 ; 


93US-0147364 . 


XX 






PR 


13-MAR-1987; 


87US-0025423 . 


PR 


22-MAR-1985; 


85US-0715142 . 


PR 


04 -AUG- 198 9; 


89US-0389929. 


PR 


04-MAR-1992 ; 


92US-0845893 . 


PR 


05-NOV-1993; 


93US-0147364 . 


XX 






PA 


(GETH ) GENENTECH INC. 


XX 






PI 


Derynck RMA, 


Goeddel DV; 


XX 






DR 


WPI; 1996-076891/08. 


DR 


P-PSDB; AAR90827. 


XX 






PT 


New recombinant human transforming growth factor-beta prods. - produced 


PT 


using Chinese 


hamster ovary cells, for use in diagnostic applications 


PT 


or in therapy 




XX 






PS 


Example 3; Fig 


1; 2 6pp; English. 


XX 






cc 


The cDNA encodes the pre-transf orming growth factor (TGF) beta 1 protein 


cc 


The nucleotide 


sequence was obtd. by an analysis of several overlapping 


cc 


cDNAs and gene 


fragments. The DNA is useful for the recombinant 


cc 


production of 


TGF beta 1, which can be used in, e.g. wound healing. 


cc 


(Revised entry 


submitted to correct sequence analysis breakdown.) 


cc 


(Updated on 25 


-MAR-2003 to correct PF field.) 


XX 






SQ 


Sequence 2537 


BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 72.2%; Score 982; DB 17; Length 2537; 

Best Local Similarity 84.9%; Pred. No. 3e-189; 

Matches 1148; Conservative 0; Mismatches 145; Indels 59; Gaps 2; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 



Db 


837 


Qy 


66 


Db 


897 


Qy 


126 


Db 


957 


Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 



II III 1 1 1 1 II IIIMIIIMI MM M IIIIIIIIIIIIII I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I I i 1 1 1 M 1 1 1 1 1 1 1 1 IIIIIIIII Mill IIIIIIIIIIIIII IIIIIIIIII 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

: Ml: I! II I lllllllllll IIIMIIIMI MM 

AG CTGG TGAAG CGG AAG CG CAT CGAGG C CAT C CG CGG C CAG ATC CTGT C CAAG CTG CGG C 1016 

TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I I I I I I 1 1 I I I I I I I I I I I I T : MIMMIMIMIMIIM Mill II II I 
TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll IIIIIIIIIIIIII llllllll MM IIIMIIIMI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGG CGGA CTACTA CG C CAAGGAGGT CAC C CG CGTG CTAATGGTGG AAAG CGG CAA C C 365 

I Mill 1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 1 M 1 1 1 M 1 1 : 1 1 ' 1 1 1 M 1 1 1 1 1 1 1 i I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCG CGTG CTAATGGTGGAAACCCA CAA CG 1196 
AAATCTATG ATAAATTCAAGGG CAC C C C C CA CAG CTTATATATG CTGTT CAACACGT CGG 425 

IIIIIIIII II II III I I MUM llllllll I 1 1 1 1 1 1 1 1 II I 

AAATCTATGACAAGTTCAAGCAGAGTACACAGAGCATATA 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 485 

lllllll MIIIMI II Mill II llllllll IIIIIIIIIIIIII lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTCAAGTTAAAAGTGGAGCAGOVCGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

MINI Mill IMI I M 1 1 1 1 1 1 

GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTGTACC^GAAATAC^GCAACAATTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 M I Mill Mill II MUM I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

II 1 1 1 II I M 1 1 1 1 1 M 1 1 IMI . Ill III III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

I II llllllll MM Mill 1 1 1 1 1 II II II II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

Ml I I MIIIMI 1 1 III M M Ml h II M I IIIIIIIIIIIIII lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I 1 1 1 1 1 Ml 1 1 III M I III M 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 ; 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 
CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

MIIIMI I 



Db 



1676 



GCCCTGGACA 168 5 



Qy 


906 


Db 


1686 


Qy 


966 


Db 


1746 


Qy 


1026 


Db 


1806 


Qy 


1086 


Db 


1866 


Qy 


1146 


Db 


1926 


vy 


1 9 D 

-L \J D 


Db 


1986 


Qy 


1266 


Db 


2046 


Qy 


1318 


Db 


2106 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

Illllll 1111:11 Mill IIIIIIIIIIMMIIIIIIIIMIII llllllllll 

CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 174 5 

TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

I I I I MM IIIIIMIIIIIII I I II lllllillllllllllllli I I I I 
TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 05 

GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1085 

1 1 1 1 Mill lilllli; MMIMI Mill MM II 1 1 1 1 1 Ml 1 1 III I I 

GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 18 65 
TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

Illllllllllll MIIMI II I II I M II M I MM 1 1 ! MM I M Ml 1 1 II 

TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 
AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

1 1 1 1 1 1 II MM III 1 1 ! I M I M MM I i II II I II Ml M II 1 1 1 II II M II Ml I 

AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1985 

TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

llllllllll lllllillllllllllllli I I I I I I 1 I I 1 I t II I II I 

TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2 04 5 



MINIMI 



TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

Illlllllllllllll I I I I I I II 
CCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 2137 



RESULT 7 


AAQ56923 


ID 


AAQ56923 standard; cDNA; 2537 BP. 


XX 




AC 


AAQ56923; 


XX 




DT 


25-MAR-2 003 (updated) 


DT 


09-JUL-1994 (first entry) 


XX 




DE 


Human pre-TGF-beta-1 . 


XX 




KW 


TGF-beta-1; TGF-beta-2; transforming growth factor beta-1; 


KW 


transforming growth factor beta -3; recombinant; wound healing; 


KW 


vulnerary; ss. 


XX 




OS 


Homo sapiens . 


XX 




FH 


Key Location/Qualifiers 


FT 


misc_structure 47.. 113 


FT 


/*tag= a 


FT 


/note= "possible hairpin loop region" 


FT 


CDS 842.. 2014 



FT 




/★fan— H 


FT 


ma f r^^r^t~ i H ^ 
1 1 id u pep L xuc 


J_ O / D . . ^ U 1 1 


FT 




/ * t- aci~ r 


FT 


puiyn d lynai 


<6 Jl J • . ^£< _J ^. -L 


FT 




/*tag= d 


XX 






PN 






XX 






PD 


08 -FFR- 1994 




XX 






PF 


04-MAR-1992; 


92US-0845893 . 


XX 






PR 


22-MAR-1985; 


85US-0715142 . 


PR 


13-MAR-1987; 


87US-0025423 . 


PR 


04-AUG-1989; 


89US-0389929. 


PR 


04-MAR-1992; 


92US-0845893 . 


XX 






PA 


(GETH ) GENENTECH INC. 


XX 






PI 


Derynk RMA, 


Goeddel DV; 


XX 






DR 


WPI; 1994-056343/07. 


DR 


P-PSDB; AAR46227. 


XX 






PT 


Nucleic acid 


sequences encoding transforming growth factor-beta - 


PT 


diagnostic probes, and for use in therapeutics 


XX 






PS 


Disclosure; 


Fig lb; 25pp; English. 


XX 






CC 


cDNA sequences were determined for human pre-TGF-beta- 1 (AAQ56923) , 




pig TGF-beta 


-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 


CC 


corresponding amino acid sequences were determined (AAR46227-29 , 


CC 


respectively) . A genomic fragment corresponding to a human TGF- 


CC 


beta -1 exon 


(AAQ56924) was also isolated and its amino acid sequence 


CC 


determined (AAR4 623 0) . The sequences have been used in the 


CC 


construction 


of vectors for the expression of recombinant TGF- 


CC 


beta . 




CC 


(Updated on 


25-MAR-2003 to correct PF field.) 


XX 






SQ 


Sequence 2537 BP; 473 A; 890 C; 742 G; 432 T; 0 other; 



Query Match 72 . 0%; 

Best Local Similarity 84.8%; 
Matches 1147; Conservative 



Score 980.4; DB 15; Length 2537; 
Pred. No. 6.2e-189; 
0; Mismatches 146; Indels 59; Gaps 



2; 



Qy 

Db 



837 



CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III I I I I II lllllllllll lllllllllllll llllllllllllll I 
CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



Qy 
Db 

Qy 

Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I MINIMI L MINIMI Mill 1 1 1 1 1 1 1 ! I II 1 1 1 1 1 1 1 1 1 1 II I 

8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 
12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 lllllllllll MM 

957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 



Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 


Db 


1676 


Qv 


906 


Db 


1686 


Qy 


966 


Db 


1746 


Qy 


1026 



TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I I i I II I! I I II I II I I I I I I I I I llllllllllllllllllll Mill II II I 
TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 305 

I II llllllll llllllllllllll llllllll I I I I MM lllllilllll 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACGGGAGCCCGAGC 1136 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M : 1 1 1 1 1 1 1 ,1 1 ! 1 1 ! 1 1 1 1 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 425 

llllllllll II MUM I I llllll IIIIMM I llllllll II I 

AAATCTATGACAAGTTC^GC^GAGTACACAC^GC^TATATATGTTCTTC^CACATCAG 12 56 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

lllllll IIIIMM II Mill II llllllll MM IMIMM lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GG CT CAAGTTAAAAGTGGAG CAG CACGTGGAG CTATAC CAGAAATACAG CAATGATT C CT 54 5 

I ! M 1 1 1 1 : 1 II ! 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M I M I i 1 1 llllll 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 1376 



III 1 1 1 M I II I II II 1 1 M 1 1 M 1 1 Mill Mill II llllll I M MM 



ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 M 1 1 1 II Ml I II III 1 1 II I II III III III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

I II IMIMM lllllllllllll llllllllll II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 5 

III I I llllllll llllllllllllllllllll IIIIIIMIIIIII lllllll 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I MIMI MINIMUM IIMIIIIIIII Mill 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 

TTCTCATGGCCACCCGGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 167 5 

CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

llllllll I 



1685 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

lllllll I I I I I ! I I I : I I I I I I I I I ! I I I I I I . . I ' ' ' I I llllllllll 
CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 174 5 

TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

MM llllllll I II 1 1 1 I ! I I f 1 1 1 II II I I MMIII I 'Ml I 1 1 1 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 05 
1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1085 



1 1 1 1 I MINI Nihil MINIM MMI M 1 1 1 1 1 M M 1 1 1 1 1 II 1 1 I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 1865 

Qy 1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

M M M I M 1 1 M M M II 1 1 M I 1 1 II 1 1 1 1 1 II II M 1 1 1 II II 1 1 1 1 M I II I II 

Db 18 66 TGTACAAC CAG CATAA C CCGGG CGC CTCGG CGG CG CCGTG CTG CGTG C CG CAGG CG CTGG 1925 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

MM 1 1 1 1 1 1 1 ! I M 1 1 1 1 1 M M 1 1 II M 1 1 1 ! 1 1 II I M 1 1 1 1 1 ; 1 1 M 1 1 1 M 

Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1985 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 12 65 

MM MM II II Mllllllll II I II Ml III II MM III Mill I 

Db 1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2 04 5 
Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill MM MM Ml II II M 1 1 M 1 1 1: 1 M 1 1 1 1 II 

Db 2 04 6 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 

Qy 1318 TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

I 1 1 1 1 1 i 1 1 1 1 1 1 MM I I M 

Db 2106 CCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 2137 



AAV52 933; 

25-MAR-2 003 (updated) 
21-DEC-1998 (first entry) 

Human pre-transf orming growth factor-beta 1 cDNA. 
Transforming growth factor-beta 1; TGF-beta 1; human; ss. 
Homo sapiens . 



RESULT 8 
AAV52933 

ID AAV52933 Standard; cDNA; 2537 BP. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 



Key 
CDS 

mat_pept ide 
stem_loop 

misc_f eature 

polyA_signal 

US5801231-A. 
01-SEP-1998. 



Location/Qualifiers 
842 . .2014 
/*tag= a 
1676. .2011 
/*tag= b 
37. .113 
/*tag= b 

/note= "putative stable hairpin loop" 
2015. .2100 
/*tag= c 

/note= "GC-rich sequence" 
2514 . .2520 
/*tag= d 



PF 30-MAY-1995; 95US- 04 54468 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 

PR 22-MAR-1985; 8 5US - 07 1 5 14 2 . 

PR 04-AUG-1989 ; 8 9US- 038 9929 . 

PR 04-MAR-1992; 92US - 084 58 93 . 

PR 05-NOV-1993; 93US- 0147364 . 

PR 30-MAY-1995; 95US - 04 544 68 . 
XX 

PA (GETH ) GENENTECH INC . 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR P-PSDB; AAW78785 . 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This nucleotide sequence codes for the human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1, see AAW78785) . It is a 

CC composite of overlapping cDNA clones isolated from different cDNA 

CC libraries (placenta, A172 glioblastoma, HT1080 f ibroblastoma) using 

CC TGF-beta exon (see AAV52936) restriction fragments as probes. 

CC The 3 ' region of the sequence was determined using cloned genomic 

CC DNA . The invention relates to the recombinant production of 

CC TGF-beta. Biologically active TGF-beta is defined as being capable 

CC of inducing EGF-potent iated anchorage independent growth of target 

CC cell lines and/or growth inhibition of neoplastic cell lines. 

CC Nucleic acids encoding TGF-beta have been isolated and cloned into 

CC vectors which are replicated in bacteria and expressed in 

CC eukaryotic cells. TGF-beta recovered from transformed cells is 

CC used in known therapeutic applications. TGF-beta nucleic acids are 

CC also useful in diagnosis and identification of TGF-beta clones. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 2537 BP; 475 A; 895 C; 736 G; 431 T; 0 other; 



Query Match 71.8%; Score 977.2; DB 19; Length 2537; 

Best Local Similarity 84.7%; Pred . No. 2.8e-188; 

Matches 1145; Conservative 0; Mismatches 148; Indels 59; Gaps 2; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III 1 1 1 1 II lllllllllll 1 1! 1 1 1 1 1 1 1 1 1 1 MINIMI MM I 

Db 837 CCCCCATG C CG CC CTC CGGG CTG CGG CTG CTG CCG CTG CTG CTA C CG CTG CTG TGG CTA C 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I I IMM IMMIII I I IMM 1 1 1 1 1 1 1 1 1 1 1 , 1 MINIMI 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCCCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

Ml 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllll lllllllllll MM 

Db 957 AGCT^GGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 



Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 


uu 


1 b / b 


Qy 


906 


Db 


1686 


Qy 


966 


Db 


1746 



TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I 1 1 II 1 1 1 1 II 1 1 1 1 M 1 1 M 1 1 M III II lllllll Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II IIIIIIII llllllllllllll IIIIIIII I I I I M I I I I II I I I II I II 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

CAG AGG CGG A CT ACTA CG C CAAGG AGGTCA C CCG CG TG CTAATGG TGGAAAG CGG CAA C C 365 

I lllll II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1; 1 1 1 1 1 I 1 1 1 1 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 425 

IIIIIIIIII II IIIIII I I MINI IIIIIIII I IIIIIIII II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATAT^ 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

lllllll IIIIIIII II Mill II IIIIIIII llllllllllllll lllllll 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

II lllllllllllllllll IIIIIIIMIII II IIMIIIIIIIII IIIIII 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 05 

1 1 1 1 1 1 1 1 1 II 1 1 II ' 1 1 1 1 II 1 1 1 1 ! Mill Mill II IIIIII I 1 1 MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 143 6 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 Ml 1 1 1 1 1 II 1 1 1 M 1 1 1 II 1 1 1 III III Ml I II Mill lllllll 

ATGTCACCGGAGTTGTG CGG CAGTGGTTGAG CCGTGGAGGAGAAATTGAGGG CTTTCGCC 14 96 
TCAGTGCCCACTGTTCCTGTGACAGC^^ 72 5 

I II IIIIIIII IIMIIIIIIIII lllllll I II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAA(^ 1556 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I IIIIIIII llllllllllllllllllll I 1 I I 1 I i I I I I I I I I II II II 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

TCCTC^TGGCC^CCCCGCTGGAGAGGGCCC^GCACCTGCACAGCTCCCGGCACCGCCGAG 8 4 5 

I 1 1 1 1 M I II I III 1 1 1 1 II I II 1 1 1 M 1 1 1 Mill I M I II 1 1 1 1 1 M 1 1 1 1 1 

TTCTCATGGCC^CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

IIIIIIII I 



1685 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

lllllll 1 1 M II 1 1 1 1 1 1 1 M I II Ml 1 1 1 1 II 1 1 II 1 1 1 1 1 1 M I IIIIIIIIII 

CC^ACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 174 5 

TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

MM IIIIIIII llllllllllllll II II 1 I i I. MM 

TCCGO^AGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 18 05 



Qy 



102 6 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 



I II I MINIM MMMM I MM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGA(^CGCAGTACAGCAAGGTCCTGGCCC 18 65 

Qy 108 6 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

Illllilllllll lllllllllll 1 1 M 1 1 1 II M M M 1 1 1 1 1 II 1 1 1 M 1 1 1 1 II 

Db 1866 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGG\GCTGTCCAACA 1985 

Qy 1206 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 12 65 

II Mill MIMIIIIIIIIIIIIIII IIIIIMM II II I II I 

Db 1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 204 5 
Qy 12 66 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill MINIM III II II Mill MM III Mill II 

Db 2 04 6 GCCCGGCCCCACCCCGCCCCGGCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 

Qy 1318 TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

I III 1 1 Ml 1 1 II I II MM I I M 

Db 2106 CCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 2137 



AAQ03268; 

25-MAR-2 0 03 (updated) 
12-AUG-1990 (first entry) 

Simian transforming growth factor-beta cDNA. 
Transforming growth factor-beta; psoriasis; TGF-beta; ss. 
Monkey . 



RESULT 9 
AAQ03268 

ID AAQ03268 standard; DNA; 1561 BP. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 



Key 

sig_peptide 
mat_peptide 



EP353772-A. 
07-FEB-1990 . 

04 - AUG- 198 9; 

05- AUG-1988; 



Location/ Qual if iers 
283 . .324 
/*tag= a 
1096. .1431 
/*tag= b 

/product =human transforming growth factor-beta 



89EP-0114458 . 
88US-0229133 . 
(ON CO ) ONCOGEN LP. 

Twardzik DR, Purchio AF, Ranchalis JE, Stevens V; 



XX 

DR WPI; 1990-038499/06. 

DR P-PSDB; AAR03743 . 
XX 

PT Inhibition of proliferation of epidermal cells - 

PT used to treat psoriasis by contacting cells with compositions 

PT containing transforming growth factor-beta. 

XX 

PS Disclosure; fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis -categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03269 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1561 BP; 301 A; 547 C; 446 G; 267 T; 0 other; 

Query Match 71.6%; Score 974.2; DB 11; Length 1561; 

Best Local Similarity 85.0%; Pred . No. l.le-187; 

Matches 1150; Conservative 0; Mismatches 143; Indels 60; Gaps 3; 

C CGAGATGG CG C CTT CGGGGCTG CGG CTCTTG C CG CTGCTG CTG CCG CTG CTGTGGCTGC 6 5 

II III 1 1 1 1 II MINIUM, 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllllll I 



I IIIIIIIIIMI 1 1 1 1 1 1 1 1 1 E 1 1 1 Mill llllllllllllll M III I 

TGGTGCTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGG 
AG CTGGTGAAG CGGAAG CG CATCG AGG CCATT CG CGG CCAG ATT CTGT C CAAG CTT CGG C 

IMIIMIMM II MMM M 1 1 1 1 lllllllllll lllllllllll 1 1 II 

AG CTGGTGAAG CGGAAGCGCATCGAGAC CATC CG CGG CCAGATCCTGTC CAAG CTG CGG C 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

I 1 1 1 Ml I M I M 1 1 II I M 1 1 1 1 I Ml 1 1 1 1 II I Ml 1 1 1 1 1 1 Mill M II I 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II MMM 1 1 1 M 1 1 1 1 1 1 1 1 I MMM MM II lllllllllll I 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAAC 556 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill 1 1 1 1 1 : 1 1 1 1 1 1 1 1 ! 1 1 1 i I !: 1 1 1 i 1 1 1 1 1 1 ; I I MM 

CGGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 616 
AAATCTATGATAAATTCAAGGGCACCCCCC^C^GCTTATATATGCTGTTCAACACGTCGG 425 

MMM' M MMM I I I MMM MMM I MM II I 

AAATCTATGA CAAG TT CAAG GAG AG CACACAG^ 676 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

.MM Mill II II Mill 1 1 1 1 1 1 1 1 llllllllllllll Mill 

AGCTCCGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 736 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMM 

GGCTC^GTTAAAAGTGGAGC^GC^TGTGGAGCTGTACCAGAAATACAGC^a^TTCCT 7 96 



Qy 


6 


Db 


257 


uy 


O D 


Db 


317 


Qy 


126 


Db 


377 


Qy 


186 


Db 


437 


Qy 


246 


Db 


497 


Qy 


306 


Db 


557 


Qy 


366 


Db 


617 


Qy 


426 


Db 


677 


Qy 


486 


Db 


737 



Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

INI IIIIIIIIMIIIMIIIIIIII Mill INI MINIMI MM MM 

GG CGATA C CT CAG CAA C CGG CTG CTGG CG CCCAG CAACTCGC CGGAGTGGTTGTCTTTTG 


605 


Db 


797 


856 


Qy 


606 


ATGT CA C CGG AG TTGTG CGG CAGTGG CTGAC CCG CAG AG AGG CTAT AG AGGGTTTT CG CC 

MMMMMMIMMIIMIMM III MM Ml II II MMI MIMII 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


857 


916 


Qy 


666 


TCAGTGCC(^CTGTTCCTGTGAC7\GCAAAGATAAC^C^CTCC^CGTGGAAATTAACGGGT 

1 II Illlllll MMMMMMMIMMMIMM II Mill II MIMII 

TTAGCGCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGT 


725 


Db 


917 


976 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III 1 1 MINIM llllllllllllll INN llllllllllllll MINI 

TCACTACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


977 


1036 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 NNINNNIN llllllllllllll II Mill NNMNNNNNI 

TTCTCATGGCCACCCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1037 


1095 


Qy 


846 


CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 

1 1 1 1 1 1 1 1 1 

GCCCTGGACA 


905 


Db 


1096 


1105 


Qy 

Db 


906 
1106 


CCAACTACTGCTTCAGCTCC^CGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 

MINI : 1 II 1 1 1 1 1 1 ; 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 II MIMII 

CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACT 


965 
1165 


Qy 


966 


TCCGGAAGGACCTGGGCTGGAAGTGGATTC^TGAACCO^GGGCTACCATGCCAATTTCT 

MM Illlllll llllllllllllll II II MIMMMINNIMIM MM 
TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 


1025 


Db 


1166 


1225 


Qy 


1026 


GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGAGA.CTCAGTAC^GCAAGGTCCTGGCTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 f 1 1 Illlllll INN II 1 II Ml MM M 1 1 1 ; 1 1 

GCCTGGGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 


1085 


Db 


1226 


1285 


Qy 


1086 


TGTACAACCA.GCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 

iiiNiiiiiiii Minn 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : i 1 1 1 1 

TGTAC^ACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1145 


Db 


1286 


1345 


Qy 


1146 


AGCC^CTGCC(^TCGTGTACTACGTGGGCCGC^GCCC^AGGTGGAGC^GCTGTC(^CA 

1 1 II II 1 II 1 Ml M M 1 1 1 1 1 1 III II ; M III 1 1 1 M II II 1 1 Ml 1 Ml 1 1 M 1 1 1 

AGCCACTGCCC^TCGTGTACTACGTGGGCCGC^GCCC^GGTGGAGC^GCTGTCCAACA 


1205 


Db 


1346 


1405 


Qy 


1206 


TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 

1 1 1 1 1 1 II 1 1 Illlllll 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 
M II 1 1 II II Illlllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 

TGATCGTGCGCTCCTGCAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAG 


1265 


Db 


1406 


1465 


Qy 


1266 


GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 

1 1 Mill IMIIMIN III II 1 MMINNINNNIN 
GCCCGGCCCCGCCCCACCCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACA 


1317 


Db 


1466 


1525 


Qy 


1318 


-TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

Mill MM II Mill 1 II 

CCCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 1558 




Db 


1526 





RESULT 10 
AAI58342 

ID AAI58342 standard; cDNA; 2742 BP. 
XX 

AC AAI58342; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polynucleotide SEQ ID NO 545. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 

KW leukaemia ; ss . 

XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2 0 00WO-US34263 . 
XX 

PR 21-JAN-2000; 2 000US- 0488725 . 

PR 25-APR-2000; 2 0 0 0US- 05523 17 . 

PR 09-JUL-2000; 2000US-0598042 . 

PR 19-JUL-2000; 2 000US-06203 12 . 

PR 03-AUG-2000; 2 000US- 0653450 . 

PR 14-SEP-2000; 2 0 0 0US- 0662 1 9 1 . 

PR 19-OCT-2000; 2 000US- 0693 036 . 

PR 29-NOV-2000; 2 000US - 0727344 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C, Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR P-PSDB; AAM39186. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Claim 1; SEQ ID NO 545; 10078pp ; English. 
XX 

CC The invention relates to human nucleic acids (AAI 57798 -AAI 61369 ) and 

CC the encoded polypeptides (AAM3 8642 -AAM4 22 13 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 



CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Activin/inhibin activity, chemotact lc/chemokinet ic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 2742 BP; 526 A; 938 C; 799 G; 479 T; 0 other; 

Query Match 71.5%; Score 972.6; DB 22; Length 2 742; 

Best Local Similarity 84.9%; Pred. No. 2.4e-187; 

Matches 1149; Conservative 0; Mismatches 144; Indels 60; Gaps 3; 

Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III I I I I II lllllllllll MIMIIIIIIII Mllllllllllll I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I 1 1 1 1 1 1 1 1 1 1 1 1 i II 1 1 M 1 1 1 1 1 1 1 Mill Mllllllllllll MINIM 

Db 8 97 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 126 AGCTGGTGAAGCGGAAGCGGATCGAGGCC^TTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

II MIMIIIIII Mill I Ml I lllir I lllllllllll lllllllllll MM 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I II I II I II 1 1 1 II IM 1 1 II 1 1 1 IM Ml I I MM Ml II IMM II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 246 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II 111:111 1 Mllllllllllll 1 1 j 1 1 1 1 1 MM 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i I 

Db 1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill I M I M I Ml M 1 1 1 1 1 M 1 1 1 1 III I II IM 1 1 M M I III I I MM 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

Qy 366 AAATCTATGATAAATTC^GGGC^CCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

IMM III M MMM I I MM I MM I I MM II I 

Db 1197 AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATAT^ 12 56 

Qy 4 26 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 485 

MMM lllllll II Mill II MM II ] I j 1 1 M 1 1 1 1 1 1 1 MMM 

Db 12 57 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

Qy 4 86 GGCTCAAGTTAAAAGTGGAGC^G(^CGTGGAGCTATAC(^GAAATA(^G(^TGATTCCT 54 5 

M M I II 1 1 1 1 1 1 1 IM I M 1 1 M I MM I IM I III 1 1 1 1 1 1 1 1 II 1 1 1 MMM 

Db 1317 GGCTC^GTTAAAAGTGGAGC^GCACGTGGAGCTGTACC^GAAATAC^GC^C^TTCCT 1376 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM M I M II I M III 1 1 1 1 1 1 1 1 M Mill Mill M MMM I II MM 

Db 1377 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

IIMIIIIIIMIM lllllllll III III III II II Mill lllllll 



Db 



1437 



ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 



1496 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



666 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I II Illlllll Illllllllllll lllllllllll II Mill II llllll 

14 97 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 5 

III I I MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 MMMMMMII 1 1 1 1 1 1 1 

1557 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 



786 



845 



TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

I 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II ! 1 1 1 E 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 



846 



905 



CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 

Illlllll I 

1676 GCCCTGGACA 1685 



906 



965 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 

llllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! II ! 1 1 1 1 1 

1686 CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 1745 
966 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCGAA.GGGCTACCATGCCAATTTCT 1025 

MM MMMM MMMMMMII II II I II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 MM 

174 6 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 1805 
102 6 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

MM MMMM MMMM Illlllll Mill I II II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 

1806 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 18 65 
1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

Illllllllllll lllllllllll 1 1 1 1 ! I M I M 1 1 1 1 1 1 1 1 1 1 II I II I 1 1 1 1 M 

1866 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 
114 6 AGCC^CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA. 12 05 

mi mimmmmiimmiiiiiiiiimiiiimmmmiiiimiimiimi 

1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1985 
12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

MMMM III Mill 1 1 1 1 1 M M Mill, II II II I II I 

1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2 04 5 
1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill MM MM III II II lllllllllllllllllll 

2 04 6 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 
1318 -TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

L 1 1 II 1 1 1 1 1 1 1! M MM I I II 

2106 CCCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 2138 



RESULT 11 
AAQ2 02 8 9 

ID AAQ20289 standard; cDNA; 1559 BP. 
XX 

AC AAQ2 028 9; 
XX 



DT 


25-MAR-2003 


(updated) 


DT 


16-APR-1992 


(first entry) 


XX 






DE 


Sequence encoding simian transforming growth factor (TGF) beta-1. 


XX 






KW 


HvriPTf prm i on 


therapy; hypotensive agent; blood pressure modulator; 


KW 


ss . 




XX 






OS 


MnnVpv 




XX 






FH 


Key 


Lor a t ion / Oua 1 i f i ers 


FT 


CDS 


262 282 

i-j \J <£j * * vj 4-J 


FT 




/*tag= a 


FT 




283 . . 324 


FT 




/*tag= b 


FT 


CDS 


325 . .1098 


FT 




/*tag= c 


FT 


mat_peptide 


1099. . 1436 


FT 




/*tag= d 


XX 






PN 


W09119513-A. 




XX 






PD 


26-DEC-1991 . 




XX 






PF 


20-JUN-1991; 


91WO-US04449. 


XX 






PR 


20-JUN-1990; 


90US-0541221 . 


XX 






PA 


(BRIM ) BRISTOL-MYERS SQUIBB CO. 


XX 






PI 


Oleson FB, 


Comereski CR; 


XX 






DR 


WPI ; 1992-024199/03. 


DR 


P-PSDB; AAR20124. 


XX 






PT 


Use of transforming growth factor (TGF) -beta and their 


PT 


antagonists 


- for modulating blood pressure, for treating 


PT 


hypertension 


and hypotension 


YY 
/V./Y 






PS 


Disclosure; 


Fig 1; 4 2pp; English. 


XX 






cc 


A new method 


for treating hypertension comprises administering a 


cc 


transforming growth factor (TGF) -beta to an individual at a dose 


cc 


effective for lowering blood pressure; the TGF-beta may be e.g. 


cc 


mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 


cc 


betal precursor, a latent TGF-beta2 precursor, hybrid TGF-beta 1 /TGF - 


cc 


beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 


cc 


complex . 




cc 


(Updated on 


25-MAR-2003 to correct PA field.) 


XX 






SQ 


Sequence 1559 BP; 300 A; 546 C; 446 G; 267 T; 0 other; 



Query Match 71.4%; Score 971.8; DB 13; Length 1559; 

Best Local Similarity 85.1%; Pred. No. 3.2e-187 ; 

Matches 1146; Conservative 0; Mismatches 142; Indels 59; Gaps 3; 



Qy 



1 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 7 0 



Db 


261 


Qy 


71 


Db 


321 


Qy 


131 


Db 


381 


Qy 


191 


Db 


441 


Qy 


251 


Db 


501 


Qy 


311 


Db 


561 


Qy 


371 


Db 


621 


Qy 


431 


Db 


681 


Qy 


491 


Db 


741 


Qy 


551 


Db 


801 


Qy 


611 


Db 


861 


Qy 


671 


Db 


921 


Qy 


731 


Db 


981 


Qy 


791 


Db 


1041 



III MM II Illllllllll 1 1 1 1 M 1 1 II 1 1 1 ,1 1 1 1 1 1 1 1' 1 1 1 1 M Ml 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 

CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 38 0 
GTGAAG CGG AAG CGCAT CGAGG CCATTCG CGGCCAGATTCTGTCCAAG CTT CGG CTTG CC 190 

IIIIIIIIIIMIMIIIIII MM IMMMIMI I MINI : I I Mill Ml 

GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 250 

IIIIIIIIMIIIIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 M II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 00 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllll MM II Illllllllll II III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 7 0 

II IIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIM I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 43 0 

Mill II MUM I I I llllll llllllll I llllllll II Mllll 

TATGAC^GTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 68 0 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 90 

II Mill II II llllllll llllllll MM, I llllllllllll 

CGAGAAG CAGTAC CTG AAC CTGTGTTG CT CT CC CGGG CAGAG CTG CGT CTG CTGAGG CT C 74 0 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 550 

Illllllllll llllllll llllllll MMIMIMMIMM IIIIIIMII 

AAGTTAAAAGTCGAGC^GCATGTGGAGCTGTACC^GAAATAC^GC^CAATTCCTGGCGA 8 00 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 610 

MMMMIMMMMIIMM Mill MM lllllllll MM lllllllll 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 860 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 0 

IIIIIIMMIIIIIIIMII Ml MM III II II Mill llllllll II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 
GCCC^CTGTTCCTGTGAC^GOWVGATAAC^CACTC^CGTGGAAATTAACGGGTTC^AT 73 0 

llllllll 1 1 1 1 1 1 1 1 1 1 Mill i I I > 1 1 1 1 1 1 II Mill II IMMIIIM I 

GCCCACTGCTCCTGTGACAGCAAAGATAAC 98 0 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 7 9 0 

I llllllll IMMMIMI:' Mill II II 1 1 II M 1 1 1 1 MMMM Mi 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 
ATGGCOVCCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 8 5 0 

M II I II II M II M M M M M I II M Mill MMMMMMMMM 

ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 1094 



Qy 851 GATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACCAAC 910 

llllllll llllll 



Db 



1095 



GCCCTGGACACCAAC 



1109 



Qy 


911 


TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGG 


970 


Db 


1110 


IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII! 1! lllllllllll 

TACTGCTTCAGCTCCACGGAGAAGAACTGGTGCGTGCGGCAGCTGTATATTGACTTCCGC 


1169 


Qy 


971 


AAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTG 

llllllll IIIIIIIIIMIII II II 1 II 1 1 1 1 1 II II 1 II 1 1 M 1 lllllllll 

AAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTG 


1030 


Db 


1170 


1229 


Qy 


1031 


GGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC 

Illllllllllllllll llllllll Mill 1 1 ^ : 1 ! Illlllllllll' IIIMI 
GGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTAC 


1090 


Db 


1230 


1289 


Qy 


1091 


AACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 


1150 


Db 


1290 


llllllll lllllllllll II III MM II! 1 1 : 1 II Ml Ml III 1 1 II 1 II 1 1 II 

AACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 


1349 


Qy 


1151 


CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 


1210 


Db 


1350 


M 1 1 1 M IM II 1 1 1 M M III 1 III M M 1 II 1 M M MM III 1 1 Ml II 1 II M 1 II 

CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 


1409 


Ov 
vy 


-L -L -L 


nTnrnTTrnTnrAAnTnrAnnTnAnnrrnc^ 

Mill llllllll M M 1 II 1 IM M M 1 II II II 1 1 II Ml 1 1 1 1 

GTGCGCTCCTGCAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCGG 


1270 


Db 


1410 


1469 


Qy 


1271 


GCCCGGCCCCACCCCCGCCCGC CTCACCGGGGCTGTATTTAAGGACA- TCGTG 

MM III III 1 Mill 1 1 lllllllllll MUM MM 

CCCCGCCCCACCCCAGCCCCGGTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTG 


1322 


Db 


1470 


1529 


Qy 


1323 


CCCCAAGGCCACTTGGGATCGATTAAA 134 9 

IIIMMIMM MM 1 1 M 

CCCCAAGCCCACCTGGGGCCCCATTAA 1556 




Db 


1530 





RESULT 12 




AAQ13392 




ID 


AAQ13392 standard; DNA; 1821 BP. 


XX 






AC 


AAQ13392; 




XX 






DT 


20-NOV-1991 


(first entry) 


XX 






DE 


Human pro-TGF 


-beta 1 gene. 


XX 






KW 


Osteogenetic; 


tumoricidal; ss. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qual if iers 


FT 


CDS 


512 . . 1684 


FT 




/*tag= a 


FT 


sig_peptide 


512 . . 598 


FT 




/*tag= b 


FT 


misc_RNA 


599 . . 1684 


FT 




/*tag= c 


FT 




/note= "pro-TGF-beta 1 



FT mat_peptide 1346.. 1684 

FT /*tag= e 

FT /note- "TGF-beta 1" 

XX 

PN JP03180192-A. 
XX 

PD 06-AUG-1991. 
XX 

PF 07-DEC-1989; 8 9 JP - 03 18243 . 
XX 

PR 07-DEC-1989; 8 9 JP- 03 18243 . 
XX 

PA (KIRI ) KIRIN BREWERY KK. 
XX 

DR WPI; 1991-271579/37. 
DR P-PSDB; AAR13813. 
XX 

PT Human pro-TGF-beta 1 prodn., for osteo-genet ic activity - by 
PT preparing DNA chain contg. base sequence coding for human 
PT pre :pro-TGF-beta 1, forming expression vector etc. 
XX 

PS Claim 1; Fig 1; 16pp; Japanese. 
XX 

CC The DNA sequence encodes human prepro-TGF-beta 1 which can be 
CC produced by recombinant methods, it has osteogenetic and 
CC tumoricidal activity. 
XX 

SQ Sequence 1821 BP; 326 A; 679 C; 508 G; 308 T; 0 other; 

Query Match 71.4%; Score 971.4; DB 12; Length 1821; 

Best Local Similarity 85.0%; Pred. No. 3.9e-187; 

Matches 1135; Conservative 0; Mismatches 141; Indels 59; Gaps 2; 

Qy 6 C CGAGATGG CG CCTTCGGGG CTG CGG CTCTTG C CG CTG CTGCTG CCG CTG CTGTGGCTG C 65 

II III I I I I II lllllllllll lllllllllllll MIIIIIIIIIMI I 
Db 5 07 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 56 6 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I llllllllllllllll lllllllll Mill MIIIIIIIIIMI llllllllll 

Db 567 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 626 

Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllll II I M II M M I II I 

Db 627 AGCTGGTGAAGCGGAAGCGCATCGAGGCC^TCCGCGGCCAGATCCTGTCCAAGCTGCGGC 686 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I M 1 1 1 1 1 ; 1 M ' I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 Mill II II I 

Db 687 TGGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 74 6 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II 1 1 1 1 ! 1 1 1 MIIIIIIIIIMI 1 1 i 1 1 i 1 1 MM llllllllllllllll 

Db 74 7 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 8 06 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill 1 1 1 M ' 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 : 1 1 1 1 1 M M I ) 1 1 1 1 1 I MM 

Db 8 07 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 8 66 



Qy 


366 


Db 


867 


Qy 


426 


Db 


927 


Qy 


486 


Db 


987 


Qy 


546 


Db 


1047 


Qy 


606 


Db 


1107 


Qy 


666 


Db 


1167 


Qy 


726 


Db 


1227 


Qy 


786 


Db 


1287 


Qy 


846 


Db 


1346 


Qy 


906 


Db 


1356 


Qy 


966 


Db 


1416 


Qy 


1026 


Db 


1476 


Qy 


1086 


Db 


1536 


Qy 


1146 


Db 


1596 


Qy 


1206 



AAATCTATGATAAATTC^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

IIIIIIIIII II IIIIII I I IIIIII MMMM I MMMM II I 

AAATCTATGACAAGTTCT^GCAGAGTACAC^CAGCATATATATGTTCTTCAACACATCAG 926 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

IIIMII IIIIIIII II Mill II MMMM MMMMMMM IIIMII 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 986 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

[ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 IIIIII 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 104 6 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill II IIIIII I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGAGTCGCCAGAGTGGTTATCTTTTG 1106 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III III III II II Mill IIIMII 

ATGT CACCGGAGTTGTG CGG CAGTGGTTGAG CCGTGG AGGGGAAATTGAGGG CTTTCG C C 1166 

TC^GTGCCCACTGTTCCTGTGACAGaUU^GATAAOVCACTCCACGTGGAAATTAACGGGT 725 

I II IIIIIIII Illllllllllll IMIIMMM II Mill II IIIMII 

TTAGCGCC(^CTGCTCCTGTGACAGC^GGGATAAC^CACTGC7UVGTGGAC^T(^CGGGT 1226 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I IIIIIIII I . I ' ; . 1 1 1 , 1 1 1 1 1 1 : 1 1 1 I i 1 1 1 ! I ! 1 1 1 1 1 1 IIIMII 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1286 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I MMMMMMMMMM! MMMMMMM Mill 1 1 1 II I M 1 1 II I II I 1 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 134 5 



II 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

IIIMII IIIIIIMMIIIIIIMIIIIIIMIIIIIIIIIIMIII IIIIIIIIII 

CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 1415 
TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1025 

MM IIIIIIII MMMM II II Ml III MM 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 14 75 
GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

MM IIIIIIII MMMM MMMM Mill MMMMMMMMMM I 

GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 153 5 
TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

1 1 1 1 M 1 1 1 II 1 1 MMMM 1 1 : 1 1 ! I ! 1 1 1 1 1 1 1 1 1 : 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 

TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1595 
AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1205 

MM I II M 1 1 1 1 II 1 1 1 M M 1 1 II 1 1 II I II II 1 1 II I II II 1 1 1 II II II 1 1 II II 

AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 1655 
12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 



Illlllllll i 1 1 1 1 1 1 1 1 ! I III I li : I IIIMMIIIII II I II I 

Db 165 6 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 1715 
Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill MM MM III II II II II II Ml I II II II III 

Db 1716 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 177 5 

Qy 1318 TCGTGCCCCAAGCCC 1332 

Illlllll I II 
Db 1776 CCGTGCCCCCAAGCC 17 90 



RESULT 13 
AAN81084 

ID AAN81084 standard; cDNA; 1560 BP. 
XX 

AC AAN81084; 
XX 

DT 25-MAR-2003 (updated) 

DT 09-OCT-1990 (first entry) 

XX 

DE Coding sequence of simian transforming growth factor-beta 1. 
XX 

KW Transforming growth factor-beta 1; tumour treatment; ss cDNA. 
XX 

OS Cercopithecus aethiops. 
XX 

FH Key Location/Qualifiers 

FT CDS 261.. 1433 

FT /*tag= a 

FT sig_peptide 282.. 323 

FT " /*tag= b 

FT mat_peptide 1095.. 1433 

FT /*tag= c 

XX 

PN EP293785-A. 
XX 

PD 07-DEC-1988. 
XX 

PF 27-MAY-1988; 88EP-0108528 . 
XX 

PR 29-MAY-1987; 87US - 0055662 . 
PR 25-JAN-1988; 88US-0147842 . 
XX 

PA (ONCO ) ONCOGEN. 

PA (BRIM ) BRISTOL-MYERS CO. 

XX 

PI Purchio AG, Gentry L, Twardzik D; 
XX 

DR WPI; 1988-347488/49. 
DR P-PSDB; AAP80647. 
XX 

PT Prodn. of simian transforming growth factor beta-1 - by culturing 
PT transfected eucaryotic cells, and new precursor proteins, useful for 
PT treating tumours. 
XX 

PS Disclosure; Page ?; pp; English. 



XX 

CC The cDNA is prepd. from African green monkey cell line BSO40 and is 

CC expressed in eukaryotic cells in plasmid pSV2 . There is 100% homology 

CC between mature simian and human TGF-beta 1. The plasmid also contains 

CC the SV40 promoter and a selection marker, esp . DHFR . 

CC (Updated on 25-MAR-2003 to correct PA field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 

XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 71.3%; Score 970.8; DB 9; Length 1560; 

Best Local Similarity 85.0%; Pred. No. 5.1e-187; 

Matches 1146; Conservative 0; Mismatches 142; Indels 60; Gaps 3; 
Qy 11 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 70 

III MM M I llllll III I III I MM III II MUM 1 1 Ml III II III 

Db 2 61 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 

Qy 71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

II MM I II MM III 1 1. 1 Mill I II III Mill II MUM MUM 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 

Qy 131 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

llllll IMMMMII MM III llllll M I MM III III Mill III 

Db 381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 

Qy 191 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 250 

1 1 1 1 1 1 1 1 ' 1 1 1 ! 1 1 1 1 1 MM III Ml II III Mill Mill II II II II 

Db 441 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 00 

Qy 251 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

III MM I MM II I III Ml I I II III II MM II 1 1 1 1 1 1 II II I 1 1 1 1 1 

Db 501 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 

Qy 311 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 70 

II II 1 1 II 1 1 II 1 1 1 1 1 II II I II II 1 1 1 1 II 1 1 M 1 1 II II I II I MM Mill 

Db 561 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 

Qy 371 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCT^ 43 0 

Mill II llllll I I I IMIII Mill Ml I MM MM II IMIII 

Db 621 TATGACAAGTTCAAGCAGAGCACAC^ 68 0 

Qy 431 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 90 

II Mill II II Mill 1 1 1 1 1! i 1 1 M 1 1 1 1 1 1 1 1 1 MILIUM 

Db 681 CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 

Qy 4 91 AAG TTAAAAGTGGAG CAG CA CGTGGAGCTATAC CAGAAATA CAG CAATGATT C CTGGCG C 550 

llllll Mill! I MM : 1 1 1 1 1 1 1 1 ! 1 1 1 : 1 1 ! IMIMIIM 

Db 741 AAGTTAAAAGTCGAGC^GC^TGTGGAGCTGTACC^GAAATACAGCAACAATTCCTGGCGA 8 00 

Qy 551 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 610 

MIIIIIMMMIIIIIIMM Mill MM MUM I MM IMIII 

Db 8 01 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 860 

Qy 611 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 0 

IMIMIIM MM II III II III MM III M II MM II MM II II 

Db 8 61 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 



Qy 671 GCC(^CTGTTCCTGTGACAGOWVGATAAC^(^CTCa\CGTGGAAATTAACGGGTTCAAT 73 0 

MINIM MMIIIMMIIIMIMMIMM 1 1 Mill II lllllllll I 

Db 921 GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 98 0 

Qy 731 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 790 

I MM llllllllllllll INN MIMMIMI II MINIM III 

Db 981 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 

Qy 791 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 85 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1041 ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGAG 10 95 

Qy 851 GATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACCAAC 910 

Ml MUM 

Db 10 96 CCCTGGACACCAAC 1109 

Qy 911 TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGG 97 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 M MIMMIMI 

Db 1110 TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGC 1169 

Qy 971 AAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTG 103 0 

111111,1 llllllllllllll II II I M M IM II M M Ml II I lllllllll 

Db 1170 AAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGGCTG 1229 

Qy 1031 GGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC 1090 

1 1 1 1 1 1 M 1 1 M II I M IMIMII Mill M II I III II Ml I II I II I MUM 

Db 123 0 GGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTGCTGGCCCTGTAC 12 8 9 

Qy 1091 AACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 1150 

IMIMII MIMMIMI 1 1 1 M 1 1 M Ml 1 1 M I M 1 1 1 1 M < 1 1 II I M II 1 1 II 

Db 12 9 0 AACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 134 9 

Qy 1151 CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 1210 

1 1 M II 1 1 Ml 1 1 1 1 1 1 M 1 1! II 1 1 1 1 1 1 II MIM II I II Ml I II I II I M 1 1 M I 

Db 13 50 GTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 14 09 

Qy 1211 GTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCGCACCCGGCAG 1270 

Mill Mill M II I M 1 1 1 M MM I III 1 1 M I lllllllll 

Db 1410 GTGCGCTCCTGAAAATGCAGCTGAGGCCCCGCCCGGCCCCGCCCCACCCCGGCAGGCCCG 14 6 9 
Qy 12 71 GCCCGGCGCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGT 1321 

MM MIMIIMI III II I Ml 1 1 1 1 II I II I II I II III 

Db 14 7 0 GCCCCGCCCCACCCCACCCCCGGTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGT 152 9 

Qy 1322 GCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

II M M I M M II MM I I 1 1 

Db 153 0 GCCCCAAGCCCACCTGGGGCCCCATTAA 1557 



RESULT 14 
AAQ03508 

ID AAQ03508 standard; DNA; 1560 BP. 
XX 

AC AAQ03508; 
XX 

DT 25-MAR-2003 (updated) 



DT 09-JAN-2003 (updated) 
DT 14-AUG-1990 (first entry) 
XX 

DE Simian Transforming growth factor - Betal. 
XX 

KW HIV; AIDS; SIV; vaccine; AZT; CD 4 ; cytokines; growth 
KW factors; ds . 
XX 

OS Cebus apella 
XX 

FH Key 
FT CDS 
FT 

FT mat_peptide 
FT 
XX 

PN EP356935-A. 
XX 

PD 07-MAR-1990. 
XX 

PF 25-AUG-1989; 
XX 

PR 25-AUG-1988; 
XX 

PA (ONCO ) ONCOGEN LP. 
XX 

PI Brankovan V, Lioubin M, Purchio A; 
XX 

DR WPI; 1990-068723/10. 
DR P-PSDB; AAR05663 . 
XX 

PT Compsns . contg. transforming growth factor beta - 

PT used for inhibitions of HIV infection and replication in vivo. 

XX 

PS Disclosure; Fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in vivo to prevent formation of synctia and 

CC inhibit HIV infection. TGF may also be used with other HIV treatments 

CC (AZT, soluble CD4 etc.). 

CC (Updated on 09-JAN-2003 to add missing OS field.) 
CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 71.3%; Score 970.8; DB 11; Length 1560; 

Best Local Similarity 85.0%; Pred. No. 5.1e-187; 

Matches 1146; Conservative 0; Mismatches 142; Indels 60; Gaps 3; 

Qy 11 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 70 

III I I I I II MINIMI! lllllllllllll lllillllllllll II III 
Db 261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 320 

Qy 71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

Illllllll lllllllllllll Mill MMMMMMI llllllll llllll 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 

Qy 131 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 



Location/Qualifiers 
267. . 1437 
/*tag= a 
1103 . . 1437 
/*tag= b 



89EP-0115719 . 
88US-0236698 . 



IIIIIIIIIIIIIMIIIIII MM Illllllllll Illllllllll Mill III 

Db 381 GTG AAG CGGAAG CG CAT CG AG A C CAT C CG CGG C CAG AT C CTGT CCAAG CTG CGG CT CG C C 44 0 

Qy 191 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 2 50 

[ i 1 1 1 1 1 1 1 1 1 1 1 1 i II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 Mill II II II II 

Db 441 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 00 

Qy 251 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

llllllll 1 1 1 . 1 1 1 1 1 ' 1 1 1 1 llllllll MM II Illllllllll II III 

Db 501 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 

Qy 311 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 370 

II 1 1 M 1 1 1 1 1 1 M I ;! M 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 M I I I MM Mill 

Db 561 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 

Qy 371 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 43 0 

Mill II MMII I I I MMII Ml Mill I llllllll II llllll 

Db 621 TATGA CAAGTT CAAG CAG AG CA CA CACAG CATATATATGTT CTT CAA CACAT CAGAG CT C 68 0 

Qy 431 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 90 

II Mill II II llllllll llllllll IIIIIMIIIMM IIMMMIMI 

Db 681 CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 

Qy 4 91 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 550 

Illllllllll I'M llllllll 1 1 1 I ! 1 1 1 1 1 1 1 1 ; 1 1 lllllllill 

Db 741 AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 8 00 

Qy 551 TACCTCAGCAAC CGG CTG CTGGCCCCCAGTGACTCACCGGAGTGG CTGT CCTTTGATGTC 610 

MMMMMIMMMMIMI Mill MM IMIIIMI MM Llllll! 

Db 8 01 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 8 60 

Qy 611 AC CGGAGTTGTG CGG CAGTGG CTGAC CCG CAGAGAGG CTATAGAGGGTTTTCG CCT CAGT 670 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I III MM III M II Mill llllllll II 

Db 861 AC CGGAGTTGTG CGG CAGTGGTTGAG CCG CGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 

Qy 671 GCC(^CTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTa^AT 73 0 

llllllll 1 1 1 1 Ml I II I II 1 1 1 1 1 1 Ml 1 1 ! II Mill II lllllllill I 

Db 921 GCC(^CTGCTCCTGTGAC^GCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 98 0 

Qy 731 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 7 90 

I llllllll 111:111111 Mill IMIIMIIII.il llllllll III 

Db 981 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 

Qy 7 91 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 8 50 

1 1 E II 1 i 1 1 1 1 1 1 1 1 j i I ! : 1 1 ! II Mill I M I II I II M M 1 1 1 1 1 1 

Db 1041 ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGAG 1095 

Qy 851 GATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACCAAC 910 

II I I I II I I II II 

Db 10 96 CCCTGGACACCAAC 110 9 

Qy 911 TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGG 970 

III IIIIMII Mill MllllinillllllilllMM II IMIIIMI 

Db 1110 TACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGC 1169 

Qy 971 AAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTG 103 0 

llllllll Mill IIIMM II II II I M 1 1 1 M II I II 1 1 1 1 IMIIIMI 



Db 



117 0 AAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTG 1229 



Qy 1031 GGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC 1090 

Illllllllllllllll llllllll Mill llllllllllllllllllll mill 
Db 123 0 GGGCCCTGTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTAC 128 9 

Qy 1091 AACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 1150 

llllllll I I I I I II I I II I I I I I I I II I I II I I I I M I I I I I I I I I II I I I I I I I I I 
Db 12 9 0 AACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA 134 9 

Qy 1151 CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 1210 

I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 13 50 CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATC 14 0 9 

Qy 1211 GTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAG 127 0 

Mill Mill II i I I I I I : I I I I I I I I I I I I I I I ! I II I II I III 

Db 1410 GTGCGCTCCTGAAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCG 1469 

Qy 1271 GCCCGGCCCCAGCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGT 1321 

MM Mill II II I III II I II I II I II Ml II III II I III 

Db 147 0 GCCCCGCCCCACCCCACCGCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGT 152 9 

Qy 132 2 GCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

! 1 1 i 1 1 1 1 1 1 M I MM I I M 

Db 153 0 GCCCCAAGCCCACCTGGGGCCCCATTAA 1557 



RESULT 15 
AAT05876 

ID AAT05876 Standard; CDNA; 2745 BP. 
XX 

AC AAT05876; 
XX 

DT 25-JUN-1996 (first entry) 
XX 

DE cDNA encoding transforming growth factor-beta 1. 
XX 

KW macrophage inducible nitric oxide synthase; iNOS; constitutive NOS; 

KW interleukin-l-beta; transforming growth factor-beta; TGF-beta; I LI -beta; 

KW nitric oxide production; hypotension; inflammation; septic shock; 

KW treatment; ds . 

XX 

OS Mammalian sp. 
XX 

FH Key Location/Qualifiers 
FT CDS 842 . .2017 

FT /*tag= a 

FT /product = transforming growth factor-beta 1 

XX 

PN W09526745-A1 . 
XX 

PD 12-OCT-1995. 
XX 

PF 05-APR-1994; 94WO-US03705 . 
XX 

PR 05-APR-1994; 94WO-US03705 . 
XX 



PA (HARD ) HARVARD COLLEGE. 
XX 

PI Lee M, Perrella MA; 
XX 

DR WPI; 1995-358443/46. 

DR P-PSDB; AAR83054. 
XX 

PT Treatment of hypotension, esp . in septic shock - by administering 

PT transforming growth factor-beta e.g. to inhibit inducible nitric 

PT oxide synthase gene transcription 
XX 

PS Disclosure; Fig 15; 52pp ; English. 
XX 

CC The cDNA encodes transforming growth factor-beta 1 (TGF-beta 1) which 

CC has been found to inhibit inducible nitric oxide synthase (iNOS) gene 

CC transcription, esp. in interleukin- 1-beta (ILl-beta) stimulated rat 

CC smooth muscle cells, and at a dose which does not inhibit constitutive 

CC NOS . TGF-beta 1 or 2 (AAR83055) or their active fragments, can be used 

CC in the treatment of hypotension, such as that associated with severe 

CC inflammation or septic shock. 
XX 

SQ Sequence 2745 BP; 527 A; 938 C; 801 G; 479 T; 0 other; 

Query Match 71.2%; Score 969; DB 16; Length 2745; 

Best Local Similarity 84.7%; Pred. No. 1.3e-186; 

Matches 1148; Conservative 0; Mismatches 145; Indels 62; Gaps 3; 
Qy 6 C CG AG ATGG CG C CTT CGGGG CTG CGG CTCTTG CCG CTG CTG CTG C CG CTG CTGTGG CTG C 65 

II Ml MM II III 1 1 II 1 1 II II MUM Mill II II llllll Ml I I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCC^CCTGCAAGACCATCGACATGG 12 5 

I II I Mill llllll II I II III Ml Mill II MM III I II 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! MMMMMI 1 1 1 1 1 i 1 1 1 1 1 MM 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill II M I 
Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II Ml 1 1 M I II 1 1 1 M II 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 

Db 1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 M 1 1 1 M 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

MINIMI II MUM I I llllll IMMMI I MMMM II I 

Db 1197 AAATCTATGACAAGTTCAAGCAGAGTACACAC^ 1256 

Qy 4 26 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT - - 4 83 

I III II I IMMMI II Mill II IMMMI Ml II Mill MM Mill 



Db 


1257 


Qy 


484 


Db 


1317 


Qy 


543 


Db 


1377 


Qy 


603 


Db 


1437 


Qy 


663 


Db 


1497 


Qy 


723 


Db 


1557 


Qy 


783 


Db 


1617 


Qy 


843 


Db 


1677 


Qy 


903 


Db 


1686 


Qy 


963 


Db 


1746 


Qy 


1023 


Db 


1806 


Qy 


1083 


Db 


1866 


Qy 


1143 


nh 


± Z/ ^ \J 


Qy 


1203 


Db 


1986 


Qy 


1263 


Db 


2046 



12 57 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
- GAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATT 54 2 

! I ] 1 1 E ! I ! 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 I ! ' ! 1 1 1 1 i i 1 1 1 1 1 I III 

GGAGGCTCAAGTTAAAAGTGGAG(^GCACGTGGAGCTGTACCAGAAATA(^GCAACAATT 1376 
CCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCT 602 

Illllll [Ill 1 1 1 i 1 1 1 1 ! I Mill Mill II MUM I II I 

CCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTT 1436 
TTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTC 662 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! Ml Ml Ml II II MIM MM 

TTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTC 14 96 
GCCTCAGTGCCCACTGTTCCTGTGACAGOWVGATAACAC^CTCCACGTGGAAATTAACG 722 

MM II IMMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIMI II MIM II MM 

GCCTTAGCGCCCACTGCTCCTGTGAC^GCAGGGATAACAC^CTGCAAGTGGACATCAACG 1556 
GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 782 

MUM I I IMMMI 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 llllllllllllll MM 

GGTTCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCC 1616 
TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 842 

MM 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill llllllllllllllll 

TGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCC 1676 

GAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGG 902 

II Illllll 
GA GCCCTGG 1685 

ATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTG 962 

I IMMMI IMMMMMMIMMMMMMMMMIMIIMM Illllll 

ACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTG 174 5 
ACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATT 1022 

Illllll llllllll llllllllllllll M II 1 1 ! 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 I 

ACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACT 1805 
TCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGG 1082 

Illllll IMMMI IMMMI llllllll Mill 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 

TCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGG 18 6 5 
CTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGC 1142 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Illllllllll I M I 1 1 1 1 1 1 1 1 II Ml 1 1 1 ! M 1 1 1 1 1 M 

CCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGC 192 5 
TGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 12 02 

Illllll 1 1 1 1 1 1 M II I II II 1 1 1 II I II II 1 1 II I II II I II I II 1 1 1 1 1 1 II I II I 

TGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 198 5 

ACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCA 1262 

lllllllllllll llllllllllllllllllll Illllllllll I II I II 

ACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGG 2 04 5 

CCCGGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGG 1314 

I I I Mill MM MM III II II llllllllllllllll 

CAGGCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGG 2105 



Qy 1315 ACATCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

III llllllllllllllll I I I I I I II 
Db 2106 ACACCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 214 0 



Search completed: October 27, 2003, 19:11:34 
Job time : 419.887 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



October 27, 2003, 18:35:27 ; Search time 404.529 Seconds 

(without alignments) 
9022.658 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-372E-38 
1361 

1 tggtaccgagatggcgcctt . 



. cgattaaagcggccgcgact 1361 



Scoring table: I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 1792395 seqs, 1340900451 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



3584790 



Database : 



Published 
1: /cgn2 
/cgn2__ 
/ cgn2_ 
/cgn2_ 
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/cgn2 
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/cgn2 
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17 



Applications_NA: * 
6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 
6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 
6/ptodata/2 /pubpna / US 0 6_NEW_PUB .seq:* 
6/ptodata/2/pubpna/US06__PUBCOMB. seq: * 
6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 
6 /p t oda t a / 2 /pubpna / PCTUS_PUBCOMB . s eq : 
6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 
6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 
6 /p t oda t a / 2 /pubpna /US 0 9A_PUBCOMB . s eq : 
6 /p t oda t a / 2 /pubpna /US 0 9 B_PUBC0MB . s eq 
6 /ptoda ta 1 2 /pubpna /US0 9C_PUBC0MB . seq 
_6 /pt oda t a / 2 /pubpna /US 0 9_NEW_PUB . s eq : 
6/ptodata/2/pubpna/US10A_PUBCOMB. seq 
_6 / p t oda t a / 2 /pubpna /US 1 0 B_PUBC0MB . s eq 
_6/ptodata/2/pubpna/US10_NEW_PUB. seq: 
_6/ptodata/2/pubpna/US60_NEW_PUB. seq: 
_6 /p t oda t a / 2 /pubpna /US 6 0_PUBCOMB .seq: 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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-833 


-381-585 


Sequence 


585, App 




40 


62 . 6 


4 


.6 


364 


12 


us 


-10 


-029 


-386-26476 


Sequence 


26476, A 




41 


62 . 6 


4 


.6 


544 


12 


us 


-10 


-029 


-386-12776 


Sequence 


12776, A 




42 


62.6 


4 


.6 


927 


12 


us 


-10 


-244 


-718-1 


Sequence 


1, Appli 




43 


57.4 


4 


.2 


486 


11 


us 


-09 


-918 


-995-25641 


Sequence 


25641, A 




44 


55.2 


4 


. 1 


594 


13 


us 


-10 


-027 


-632-141376 


Sequence 


141376, 




45 


50.4 


3 


.7 


658 


9 


us- 


09- 


765- 


527-254 


Sequence : 


254, App 



ALIGNMENTS 



RESULT 1 
US-10-087-268-1 

; Sequence 1, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE : Fibrosis 

; CURRENT APPLICATION NUMBER : US/ 1 0/087 , 2 68 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 1 

LENGTH: 1821 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 

NAME /KEY: 5 » UTR 

LOCATION: (1) . . (511) 

OTHER INFORMATION: 

NAME /KEY: CDS 

LOCATION: (512) .. (1684) 

OTHER INFORMATION: 

NAME/KEY: sig_peptide 

LOCATION: (512) . . (598) 

OTHER INFORMATION: 

NAME /KEY: 3 ' UTR 

LOCATION: ( 168 5 ) . . ( 182 1 ) 

OTHER INFORMATION: 
US-10-087-268-1 

Query Match 71.5%; Score 973; DB 14; Length 1821; 

Best Local Similarity 85.1%; Pred. No. 1.4e-254; 

Matches 1136; Conservative 0; Mismatches 140; Indels 59; Gaps 2; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 



Db 



507 




Qy 



66 



TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 



Db 



567 




Qy 



126 



AG CTGGTG AAG CGG AAG CG CAT CGAGG CCATTCG CGG C CAG ATT CTGT CCAAG CTTCGG C 18 5 



Db 



627 




Qy 



186 



TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 



Db 



687 




Qy 



246 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 305 



Db 



747 




Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 


365 


Db 


807 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


866 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


867 


II 1 1 II 1 II 1 II 1 II II 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 llllilll II 1 
AAATCTATGACAAGTTCAAGCAGAGTACACAC^GCATATATATGTTCTTCAACAC^TCAG 


926 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


485 


Db 


927 


lllllll 1 II II 1 1 1 II 1 II II II llllilll II 1 1 1 II II II 1 II II II 1 II 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


986 


Qy 


486 


GG CTCAAGTTAAAAGTGG AG CAGCACGTGGAG CTATAC CAGAAATACAG CAATGATT CCT 


545 


Db 


987 


1 II 1 1 II II II II II II 1 II 1 1 II 1 II 1 1 1 1 1 II II II 1 1 II II II 1 1 1 1 1 II II 1 1 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


1046 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM II II II II II II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 II 1 1 1 II 1 1 II 1 II 1 
GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 


Db 


1047 


1106 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 


665 


Db 


1107 


II 1 II 1 1 1 1 II II II 1 II II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II II 1 1 
ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


1166 


Qy 


666 


TC^GTGCCCACTGTTCCTGTGAC^GC^^GATAAC^CACTCCACGTGGAAATTAACGGGT 


725 


Db 


1167 


1 II 1 1 II II II lllllllllllll IIIIIIIIIII 1 1 1 1 1 1 1 II lllllll 
TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


1226 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 


785 


Db 


1227 


III 1 1 1 1 1 II 1 II II II 1 1 1 II II 1 1 1 II 1 1 II Illlllllllllll II II 1 1 1 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


1286 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 


845 


Db 


1287 


1 1 1 1 1 II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II II 1 II 1 1 1 II II 1 II 1 1 II 1 II 1 1 1 1 
TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


1345 


Qy 


846 


CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 


905 


Db 


1346 


llllilll 1 

GCCCTGGACA 


1355 


Qy 


906 


CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 


965 


Db 


1356 


lllllll 1 II 1 1 1 1 II 1 II II 1 II 1 1 1 1 1 1 1 II II 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 II 1 1 1 
CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 


1415 


Qy 


966 


TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACGCAAGGGCTACCATGCCAATTTCT 
i i i i iiiiiiii iiiiiiiiiiiiii ii ii i i i i i i i i i i i i i i i i i i i i i i i i 


1025 


Db 


1416 


MM II II II 1 1 1 1 1 1 1 1 1 II 1 II II II II 1 II II 1 1 II 1 1 1 M II 1 1 II MM 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 


1475 


Qy 


1026 


GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 


1085 


Db 


1476 


MM MIMMI IIIIIIII IIIIIIII Mill IIIIIIIIIJIIi.il 1 1 ! 1 1 

GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCGC 


1535 


Qy 


1086 


TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1145 


Db 


1536 


lllllllllllll IIIIIIIIIII IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 
TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1595 



Qy 114 6 AGCC^CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 0 5 

1 1 II MIMMIMMIIIIIMIMMIIIIIMMIMIIMIMIIIIMMIIM 

Db 1596 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 165 5 

Qy 1206 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

Illlllllll llllllllllllllllllll IMIIIIIIIII II I II I 

Db 1656 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 1715 

Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I Mill INI MM III II II ; 1 1 1 1 1 1 1 1 1 , ' I , I Ml I 

Db 1716 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 1775 

Qy 1318 TCGTGCCCCAAGCCC 1332 

M 1 1 1 1 1 ! I 1 1 

Db 1776 CCGTGCCCCCAAGCC 179 0 



RESULT 2 

US-10-037-270-220 

Sequence 220, Application US/10037270 
Publication No. US20030104529A1 
GENERAL INFORMATION: 

APPLICANT: Tang, Y. Tom 

APPLICANT: Liu, Chenghua 

APPLICANT: Asundi, Vinod 

APPLICANT: Zhang, Jie 

APPLICANT: Ren, Feiyan 

APPLICANT: Chen, Rui-hong 

APPLICANT: Zhao, Qing A. 

APPLICANT: Wehrman, Tom 

APPLICANT: Xue , Aidong J. 

APPLICANT: Yang, Yonghong 

APPLICANT: Wang, Jian-Rui 

APPLICANT: Zhou, Ping 

APPLICANT: Ma, Yunqing 

APPLICANT: Wang, Dunrui 

APPLICANT: Wang, Zhiwei 

APPLICANT: Tillinghast, John 

APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. US2 0 03 0 1 04 5 29Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/10/ 037 , 270 
CURRENT FILING DATE: 2002-01-04 
PRIOR APPLICATION NUMBER : 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER : 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 220 

LENGTH: 2 742 

TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (842) . . (2014) 



US-10-037-270-220 



Query Match 71.5%; Score 972.6/ DB 14; Length 2742; 

Best Local Similarity 84.9%; Pred . No. 2e-254; 

Matches 1149; Conservative 0; Mismatches 144; Indels 60; Gaps 3; 



Ov 

wy 




pppapatppppppttpppppptppppptpttppppptpptpptppppptpptptppptpp 




LJU 


A ^7 

O -D 1 


II III MM II IMMIMIM IIMIIIIIIIII 1 M 1 1 1 1 1 II 1 1 i 1 1 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


o y d 


Ov 
wy 


66 


taptpptgapppptpppppppppppppppppaptptppapptppaapappatppapatpp 

1 MMIIIIMII IIMM ll'lll'l MM! MMMMMMM MMMMM 

TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGArTATCGACATGG 


J- £t ~J 


Db 


8 97 


956 


Ov 


126 


AG PTGGTGAAG CGGAAG PGPATPGAGGPPATTPPPGG PPAGATTPTGTP PAAP PTT PGP P 


185 


Db 


9S7 


MMMMIIIIIMMMMMIIIMMI MMMMIM MMIMMM MM 

AG CTGGTGAAG CGGAAG CG CATCG AGG CCATC CG CGG CCAGAT C CTGTC CAAG CTG CGG C 




Ov 


1 o u 


ttpppappppppppapppapppppapptpppppppppppppptppptpapppaptaptpp 

1 II M M M II 1 1 M M 1 II II II IMMMMIIIIMMMI Mill II II 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


^4 1 D 


Db 


1017 


1 07^ 


Ov 


24 6 


PT CTTT A PAA P AGTA PP PG PP A PPGGPTAP P PPPGP AAAPTPT PP AA P PPG A P P P PP A O P 

1 1 1 MMMM MMMMMMM MMMM MM MMMMMMMM 

PPPTGTAPAAPAGPAPPPGPPAPPPPPTGPPPPPPPAGAPTnPAPAAPPPPAPPPPPAPP 


_3 U J 


Db 


1077 


X X _J u 


Ov 


3 06 


PAPAGGPPPAPTAPTAPPPPAAPPAPPTPAPPPPPPTPPTAATPPTPPAAAPPPPPAAPP 


JDJ 


Db 


X X _J ' 


1 MMI 1 1 M 1 M M 1 1 1 M M M M 1 1 M 1 M II M M M M M M 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCC^CAACG 




Ov 

wy 


J u u 


AAATPTATPATAAATTPAAPPPPAPPPPPPAPAPPTTATATATPPTPTTPA APAPPTPPP 

MMMMM II IIIIII 1 1 IIMM MMMM 1 MMMM II 1 

AAATCTATGACAAGTTC1AAGCAGAGTA(LA.CA(^ 




Db 


1197 


x. ^> □ o 


Ov 
wy 


426 


AGPTPPPPGAAPPPPTPPPPPAAPPTPTATTPPTPTPTPPPPPAPAPPTPPPPPTPPTPA 

MMMI IMMMI M MMI M MMMM MMMIMMMI MMMI 

AGPTPPnAGAAPPPPTAPPTPAAPPPPTPTTPPTPTPPPPPPPAPAPPTPPPTPTPPTPA 


4 ft R 
1 o o 


Db 


12 57 


IJIO 


Ov 

wy 


4 R 

TOD 


PPPTPAAPTTAAA APTPPAPPAPPAPPTPPAPPTATAPPAPA A ATAPAPPA ATPATTPPT 


O ^ D 




1 7 

4. J 1 / 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 IMMMMIIMMM MMII 

PPPTPA APTTA A A APTPP APPAPPA PPTPP APPTPTA PPAPA A ATA PAPP A A P A ATTPTT 


1 ^ ~7 £ 
J. J / D 


Qy 


546 


GGCGCTACCTCIAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM IMMMMMMMIMIMM Mill Mill II IIMM 1 II MM 

GGCGATACCTClAGCA&CCGGCTGCTGGC^rc 


605 


Db 


1377 


1436 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

IIIIIIII MMMMMMIMM! IIMM Mill M MM! MMMI 

ATGT CAC CGGAGTTGTG CGG CAGTGGTTGAGCCGTGGAGGGGAAATTGAGGG CTTT CG C C 


665 


Db 


1437 


1496 


Qy 


666 


TCAGTGCCCIACTGTTCCTGTGACAGCAAAGATAAC^CIACTCCACGTG 


725 


Db 


1497 


1 M MMMM 1 11 1 1 M 1 1 ! 1 1 1 IMMIMIM II Mill II MMMI 

TTAGCGCCC^CTGCTCCTGTGACAGCAGGGATAACA 


1556 


Qy 


726 


TC1AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 


785 


Db 


1557 


Ml 1 1 MMMM llllllllllllllllllll IMIMMIIMM MMMI 

TCACTACCGGCCGCCGAGGTGACCTGGCC1A.CCATTCATGGC7VTGAACCGGCCTTTCCTGC 


1616 



Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI Mill II I i 1 1 1 1 1 1 1 1 1 ! 1 1 1 

Db 1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

Qy 84 6 CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

llllllll I 

Db 1676 GCCCTGGACA 1685 

Qy 906 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

MINI M III III IIIMIIIIII Mill I II II I INI liM Ml MMMMM 

Db 1686 CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 174 5 

Qy 966 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 102 5 

MM MMMM MMMMMMM 1 1 1 1 MMMMMMMMMM MM 

Db 174 6 TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 1805 

Qy 1026 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 

MM MMIMI MMMM MIMMI MMI MIMMMMMMMMI I 

Db 18 06 GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 18 65 

Qy 1086 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 114 5 

MMMMMMI MMMMIM M I M M M M M M M M M M M M M M M I 

Db 1866 TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1925 

Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

MM MMMMMMMMMMMMMMMMMMMMMMMMMMMI 

Db 1926 AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 198 5 



Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 12 65 

MMMMM MMMMIMMMMMI MMMMMM MMI I 

Db 1986 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 2 04 5 
Qy 1266 GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 1317 

I I MMI MM MM Ml M II M II II 1 1 1 1 1 1 1 M 1 1 II 

Db 2 04 6 GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACA 2105 



Qy 1318 - TCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

MMMMMMI MM I I II 

Db 2106 CCCGTGCCCCAAGCCCACCTGGGGCGCCATTAA 213 8 



RESULT 3 
US-10-087-268-4 

; Sequence 4, Application US/10087268 
/ Publication No. US20030119010A1 
; GENERAL INFORMATION: 

APPLICANT: Jonsonn, Julie Ruth 
; APPLICANT: Powell, Elizabeth Ellen 

; TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER: US/10/087 , 268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 4 



LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME /KEY: 5 1 UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME /KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/KEY : s ig_pept ide 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME/ KEY: 3 1 UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-4 

Query Match 71.4%; Score 971.4; DB 14; Length 1821; 

Best Local Similarity 85.0%; Pred. No. 3.9e-254; 

Matches 1135; Conservative 0; Mismatches 141; Indels 59; Gaps 2; 

CCGAGATGGCG C CTTCGGGG CTG CGG CT CTTG C CG CTG CTGCTG C CG CTGCTGTGG CTGC 6 5 

II III I I I I II MINIM I. I I I I I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 ! 1 1 I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 5 66 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I I INI III N MINIM Mill IMMMMIMII llllllllll 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 62 6 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

ill lllllllllll IIIIIIIIIM MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 68 6 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 74 6 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I M IMMMI 1 1 1 1 1 1 1 1 1 IE 1 1 1 MMIMI MM 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 8 06 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 8 66 
AAATCTATGATAAATTCAAGGGCACCCCCCACA^ 425 

llllllllll II MINI I I MINI IMMMI I IMMMI II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATA 92 6 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

lllllll IMMMI II Mill II IMMMI MIMMMIMM I II 1 1 1 1 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 98 6 
GG CT CAAGTTAAAAGTGG AG CAG CA CGTGG AG CTATACCAGAAATACAG CAATG ATT C CT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 i 1 1 1 1 [ 1 1 1 1 1 ! 1 1 M I M II 1 1 1 1 llllll 

GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATA(^GCAACAATTCCT 104 6 



Qy 


6 


Db 


507 


wy 


D D 


Db 


567 


Qy 


126 


Db 


627 


Qy 


186 


Db 


687 


Qy 


246 


Db 


747 


Qy 


306 


Db 


807 


Qy 


366 


Db 


867 


Qy 


426 


Db 


927 


Qy 


486 


Db 


987 



Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

III! IIMIIIIIMIIIIIIIIIIII Mill Mill II MUM 1 II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 


Db 


1047 


1106 


Qy 


606 


ATGT CACCGGAGTTGTG CGG CAGTGG CTGA C CCG CAGAGAGG CTATAG AGGGTTTT CG CC 

1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III Ml III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1107 


1166 


Qy 


666 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 1 1 IIIIIMI MIMMIIIIII MIIMIMM II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACAGTGCAAGTGGACATCAACGGGT 


725 


Db 


1167 


1226 


Qy 


726 


TCAATTCTGGCGGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III 1 1 IIIIIMI 1 J 1 I 1 I 1 1 i 1 1 1 1 I 1 1 1 1 J 1 MIMIMMMM lllllll 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1227 


1286 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCAGCTGCACAGCTCCCGGCACCGCCGAG 

1 MIMIIIIIIIMIIIMIIIIIIIIIIIII Mill IMIIIMIMIIIIIII 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1287 


1345 


Qy 


846 


CCCTGGATACCAACAGGTACCCATACGAGGTGCCAGACTACGCATCTCTGGCCCTGGATA 

IIIIIMI 1 

GCCCTGGACA 


905 


Db 


1346 


1355 


Qy 
Db 


906 
1356 


CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 

lllllll IIIIMIIIIIMIIIIMIIIIIMIIIMIIIMMIII IMMIIMI 

CCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACT 


965 
1415 


Qy 


966 


TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 

MM IIIIIMI MIMMMMIM II II IIIIIIIMIIIIIMIIII MM 

TCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 


1025 


Db 


1416 


1475 


Qy 

Db 


1026 
1476 


GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 

MM IIIIIMI IIIIIMI IIIIIMI Mill MMMMMIMMIMM 1 

GCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 


1085 
1535 


Qy 


1086 


TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1145 


Db 


1536 


II 1 II 1 II 1 M II MIIMIMM M i 1 1 Ml 1 1 Ml III MM M 1 M M Ml 1 II 

TGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1595 


Qy 


1146 


AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 

MM IMIMMMIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIMIMIIIIIIII 

AGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 


1205 


Db 


1596 


1655 


Qy 


1206 


TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 

1 11 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II i II 1 

1 1 II II II II I 1 II I II 1 II II II 1 II 1 M 1 1 1 1 1 II II 1 II II 1 M 1 

TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAG 


1265 


Db 


1656 


1715 


Qy 


1266 


GGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA 

1 1 Mill MM MM Ml II M II II 1 M M 1 1 1 1 1 1 1 II 1 

GCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGGCCATGGGGGCTGTATTTAAGGACA 


1317 


Db 


1716 


1775 


Qy 


1318 


TCGTGCCCCAAGCCC 1332 




Db 


1776 


IIIIIMI 1 M 

CCGTGCGCCCAAGCC 17 90 





RESULT 4 

US-09-948-002-28 

; Sequence 28, Application US/09948002 
; Publication No. US20030050265A1 
; GENERAL INFORMATION: 
; APPLICANT: Nicholas M . Dean 
APPLICANT: Susan F. Murray 
; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER : US/09/948 , 002 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

; PRIOR FILING DATE: 1999-09-17 

; NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 28 

LENGTH: 274 5 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY : CDS 
LOCATION: (842) . . . (2017) 
US-09-948-002-28 

Query Match 71.2%; Score 969; DB 11; Length 2745; 

Best Local Similarity 84.7%; Pred. No. 1.9e-253; 

Matches 1148; Conservative 0; Mismatches 145; Indels 62; Gaps 3; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 



Db 




Qy 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 



Db 




Qy 



126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 



Db 




Qy 



18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 



Db 




Qy 



24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 



Db 




Qy 



Db 




Qy 



3 66 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

llllllllll II MINI I I 1 1 1 1 1 1 MINIM I IMIMM M I 



Db 


1197 


Qy 


426 


Db 


1257 


Qy 


484 


Db 


1317 


Qy 


543 


Db 


1377 


Qy 


603 


Db 


1437 


Qy 


663 


Db 


1497 


Qy 


723 


Db 


1557 


Qy 


783 


Db 


1617 


Qy 


843 


Db 


1677 


Qy 


903 


Db 


1686 


Qy 


963 


Db 


1746 


Qy 


1023 


Db 


1806 


Qy 


1083 


uv 


1 Q £ (Z 


Qy 


1143 


Db 


1926 


Qy 


1203 


Db 


1986 



1197 AAATCTATGACAAGTTCAAGCAGAGTACACAC^ 1256 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT- - 4 83 

Illllll IIIIIIII II Mill II IIIIIIII IIIIIIIIIIIIII Mill 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

- GAGGCT CAAGTTAAAAGTGGAG CAG CACGTGGAG CTATACCAGAAATA CAG CAATGATT 542 

I MM I M M M M I M 1 1 1 ! M I ! 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 III 

GGAGG CTCAAGTTAAAAGTGG AG CAG CA CGTGG AG CTG TA CCAG AAATA CAG CAA CAATT 1376 
CCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCT 602 

Illllll lllllllllllllllllllllll Mill Mill II IMIM I II I 

CCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTT 1436 
TTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTC 662 

MIIIIIIIIIIIIIIIIMIIIIMIII III III III II II Mill MM 

TTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTC 14 96 
GCCT(^GTGCC(^CTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACG 722 

MM II IIIIIIII IMIIIIIIIIM IMIIMMM M Mill II MM 

GCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACG 1556 
GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 782 

IMIM I I 1 1 1 1 1 1 E I MMMIIMIMMIMM MIMMMIMM MM 

GGTTCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCC 1616 
783 TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 84 2 

MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 E 1 1 1 1 ! 1 1 1 Mill IIIIIIII III 

TGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCC 1676 

GAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGG 902 

II Illllll 
GA GCCCTGG 1685 

ATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTG 962 

I MIMMI ! M M M M M Ml M 1 1 1 1 1 1 1 IM I M I ! 1 1 1 1 M 1 1 1 Illllll 

ACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTG 174 5 
ACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGC CAATT 1022 

Illllll IIIIIIII MMIMMIMM II II I J 1 1 1 E 1 1 1 1 f 1 1 1 1 1 1 1 1 E I 

ACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACT 18 05 
TCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGG 1082 

Illllll MIMMI MIMMI MIMMI Mill I M 1 1 1 1 1 1 1 1 1 M II I II 

TCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGG 1865 
CTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGC 1142 

I MMIMMIMM M M I II 1 1 II II 1 1 1 1 1 1 1 1 II I II 1 1 II II 1 1 1 

C C CTGTA CAACCAG CATAA C C CGGG CG C CTCGG CGGCG C CGTG CTG CGTG CCGCAGG CG C 192 5 
TGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 12 02 

Illllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 

TGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 1985 
ACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCA 1262 

in iiiiiiiiiiiiiiiiiiii : ii I in ii i ii 

ACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGG 2 04 5 



Qy 1263 CCCGGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGG 1314 

I I I Mill I I I I I I I I III II II llllllllllllllll 

Db 2 04 6 CAGGCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGG 2105 

Qy 1315 ACATCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

III llllllllllllllll Mill I II 
Db 2106 ACACCGTGCCCCAAGCCCACCTGGGGCCCCATTAA 214 0 



RESULT 5 
US-09-948-002-1 

Sequence 1, Application US/09948002 
Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 

TITLE OF INVENTION: ANT I SENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER: US/09/ 948 , 002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 1 
LENGTH: 2094 
TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (868) . . . (2040) 
US-09-948-002-1 

Query Match 62.5%; Score 85 0.2; DB 11; Length 2 0 94; 

Best Local Similarity 80.9%; Pred . No. 3.7e-221; 

Matches 1040; Conservative 0; Mismatches 188; Indels 57; Gaps 2; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II Ml MM llllllllllllll IIIIMI Mill 1 1 II I I MM I 

Db 8 63 CCCCCATGCCGCCCTCGGGGCTGCGGCTACTGCCGCTTCTGCTCCCACTCCCGTGGCTTC 922 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

Illllllllllll II MM Mill Mill 1 1 1 1 1 I ! 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 

Db 923 TAGTGCTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGACATGG 982 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

Illlllllll Illllllllllll Mill II llllllll llllllll II III 

Db 983 AGCTGGTGAAACGGAAGCG(^TCGAAGCCATCCGTGGC(^GATCCTGTC(^AACTAAGGC 104 2 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I Mill Mill lllllllllll II I I j I I I I I I I I I I I I I I Mill II II I 
Db 1043 TCGCCAGTCCCCCAAGCCAGGGGGAGGTACCGCCCGGCCCGCTGCCCGAGGCGGTGCTCG 1102 



Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 



Db 


1103 


Qy 


306 


Db 


1163 


Qy 


366 


Db 


1223 


Qy 


426 


Db 


1283 


Qy 


486 


Db 


1343 


Qy 


546 


Db 


1403 


Qy 


606 


Db 


1463 


Qy 


666 


Db 


1523 


Qy 


726 


Db 


1583 


Qy 


786 


Db 


1643 


Qy 


846 


Db 


1702 


Qy 


906 


Db 


1712 


Qy 


966 


Db 


1772 


Qy 


1026 


Db 


1832 


Qy 


1086 



II I IIIIIIII 1 1 1 1 1 1 1 1 II M 1 1 II II II II I III II Mill IMI 

CTTTGTACAACAGCACCCGCGACCGGGTGGCAGGCGAGAGCGCCGACCCAGAGCCGGAGC 1162 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I II lllllllllll II I I llllllllllllllllllllllllll II MM 
CCGAAGCGGACTACTATGCTAAAGAGGTCACCGGCGTGCTAATGGTGGACCGCAACAACG 1222 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 425 

IIIIIIII III III I M I I MMI IIIIIIII I MMI Mill I 

CCATCTATGAGAAAACCAAAGACATCTCACACAGTATATATATGTTCTTCAATACGTCAG 1282 
AG CT C CGGG AAG CGGTG CCGGAACCTGTATTG CTCT CT CGGG CAGAG CTG CG CCTG CTGA 4 85 

I I IIIIIIII Mill Mill MMM M II MMIIIIM Ml II 

ACATTCGGGAAGCAGTGCCCGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCTTGCAGA 1342 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

I I II I II IIIIIIII II Mill II Mill MM Mill Mill! 

GATTAAAATCAAGTGTGGAGCAACATGTGGAACTCTACCAGAAATATAGCAACAATTCCT 14 02 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM Mill I IMI IIIIII IIIIII III I II lllllllllll MM 

GGCGTTACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTG 1462 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

I Mill IIIIIIII lllllllllllll II MM I III MM MMI 

ACGTCACTGGAGTTGTACGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGAT 1522 
TCAGTGCCCAGTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

MM II Mill II 1 1 Ml II I MM 1 1 1 1 : MM IIIIIIII I MMM 

TCAGCGCTCACTGCTCTTGTGACAGCAAAGATAACAAACTCCACGTGGAAATCAACGGGA 1582 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I II Mill Ml I IIIIII II I III I II I M M II 1 1 1 1 M I 

TCAGCCCCAAACGTCGGGGCGACCTGGGCACCATCCATGACATGAACCGGCCCTTCCTGC 164 2 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

II I II I II I M M 1 1 1 Mill I M I II I II I II 1 1 Ml III I ' IIIIIIII II 

TCCTCATGGCCACCCCCCTGGAAAGGGCCCAGCACCTGCACAGCTCACGGCACCGGAGA- 1701 



I I I I I I I I I I 



CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 965 

IIIIII 1 1 1 II I II II M II I II I MM III 1 1 1 lllllllllll MMIIIIM 

CCAACTATTGCTTCAGCTCCACAGAGAAGAACTGCTGTGTGCGGCAGCTGTACATTGACT 1771 

TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 102 5 

I lllllllllllll lllllllllll M II II Mill IIMIIMII MM 

TTAGGAAGGACCTGGGTTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 1831 



I Mill Mill MMI II IIIIIIII Mill I III MM M 1 1 1 1 1 ! II I 

GTCTGGGACCCTGCCCCTATATTTGGAGCCTGGACACACAGTACAGCAAGGTCCTTGCCC 
TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 

I IIIIIIII 1 1 1 1 1 1 1 1 1 1 1 II I IIIIII I 1 1 1 II Ml I II I III Ml M III 



Db 



18 92 TCTACAACCAACACAACCCGGGCGCTTCGGCGTCACCGTC 1951 



Qy 114 6 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 12 05 

Illllllllllllll! lllllllllll lllllllllllllllllllll lllllllll 
Db 1952 AGCCACTGCCCATCGTCTACTACGTGGGTCGCAAGCCCAAGGTGGAGCAGTTGTCCAACA 2011 

Qy 12 06 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCC 1265 

MM Mill II Illllllllllllll I MINI III III III Ml III 

Db 2 012 TGATTGTGCGCTCCTGCAAGTGCAGCTGAAGCCCCGCCCCGC CCCGCCCCTCCC 2065 

Qy 12 66 GGCAGGCCCGGCCCCACCCCCGCCC 12 90 

MM Ml IIIIM II lllllllll 

Db 2 066 GGCAGGCCCGGCCCCGCCCCCGCCC 2 090 



RESULT 6 

US-09-948-002-27 

Sequence 27 , Application US/09948002 
Publication No. US2 003 0050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M . Dean 
APPLICANT: Susan F. Murray 

TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER : US/09/948 , 002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 27 
LENGTH: 1585 
TYPE : DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME /KEY : CDS 
LOCATION: (413) . . . (1585) 
US-09-948-002-27 

Query Match 61.0%; Score 830.6; DB 11; Length 1585; 

Best Local Similarity 81.3%; Pred. No. 7.4e-216; 

Matches 999; Conservative 0; Mismatches 179; Indels 51; Gaps 1; 
Qy 6 C CG AG ATGG CG C CTTCGGGG CTG CGG CT CTTG C CG CTG CTG CTG CCG CTG CTGTGG CTG C 65 

M III MM 1 1 11 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 Mill II II I IIIIM I 

Db 4 08 CCCCCATGCCGCCCTCGGGGCTGCGGCTGCTGCCGCTTCTGCTCCCACTCCCGTGGCTTC 4 67 

Qy 66 T AGTG CTG ACG C CTGG C CGG C CGG C CG C CGGA CTGT CCA C CTG CAAGA CCAT CGA CATGG 125 

1 1 1 1 E 1 1 1 1 1 1 1 1 II 1 1 1 1 Mill Mill 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 E 1 1 1 1 1 1 1 

Db 4 68 TAGTGCTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGA CATGG 527 

Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

IMMMMI 1 1 1 1 1 1 r 1 1 1 1 1 1 1 Mill II llllllll IIIIIIM M III 

Db 52 8 AGCTGGTGAAACGGAAGCGCATCGAAGCCATCCGTGGCCAGATCCTGTCCAAACTAAGGC 587 



Qy 18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I Mill I MM II II Mill ! 1 1 1 1 1 1 1 1 1 1 Mill II II I 

Db 58 8 TCGCCAGTCCCCCGAGCCAGGGGGAGGTACCGCCGGGCCCGCTGCCCGAGGCGGTGCTCG 64 7 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

M I llllllll M I I 'MM M II II II I II lllllllllllll 

Db 64 8 CTTTGTACAACAGCACCCGCGACCGGGTGGCAGGCGAGAGCGCTGACCCGGAGCCCGAGC 7 07 

Qy 306 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I IIIIIIIIIIIIIIIIMM II 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 ! 1 1 ! 1 1 II I II I 

Db 7 08 CCGAGGCGGACTACTACGCCAAAGAAGTCACCCGCGTGCTAATGGTGGACCGCAACAACG 767 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

llllll III III I II I I Mill IMIIIII I Mill Mill I 

Db 768 CAATCTATGACAAAACCAAAGACATCACAC^ 82 7 

Qy 426 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

I I llllllll Mill Mill llllll II II IIIIIIMIIIIIIII II 
Db 82 8 ACATTCGGGAAGCAGTGCCAGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCCTGCAGA 887 

Qy 4 8 6 GG CTCAAGTTAAAAGTGGAG CAG CA CGTGGAG CTATAC CAGAAATA CAG CAATG ATTCCT 54 5 

I llllll II llllllll Mill II II lllllilllll Mill MMM 

Db 888 GATTCAAGTCAACTGTGGAGCAACACGTAGAACTCTACCAGAAATATAGCAACAATTCCT 94 7 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM Mill I MIMIMMM llllll III I II IMIMIMM MM 

Db 94 8 GGCGTTACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTG 1007 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

I Mill MMMM MIMMIIMM II MM I III MM llllll 

Db 1008 ACGTCACTGGAGTTGTCGGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGCT 1067 

Qy 666 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

MMM Mill II MM MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Ml 

Db 1068 TCAGTGCTCACTGCTCTTGTGACAGCAAAGATAATGTACTCCACGTGGAAATCAATGGGA 112 7 

Qy 72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

Ml II MM Illlllllll MMM M I MMMM Illlllllll 

Db 112 8 TCAGTCCCAAACGTCGAGGTGACCTGGGCACCATCCATGACATGAACCGACCCTTCCTGC 1187 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

MMM MMMM Mill Mill II 1 1 1 II 1 1 1 1 1 II 1 1 1 MMM II 

Db 118 8 TCCTCATGGCGACCCCCCTGGAAAGGGCTCAACACCTGCACAGCTCCAGGCACCGGAGA- 124 6 

Qy 84 6 CCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATA 905 

Illlllllll 

Db 124 7 GCCCTGGATA 1256 

Qy 906 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGC^GCTCTACATTGACT 96 5 

Illlllllllllllllllllll MMM MMM II MMMM illlllllll 

Db 12 57 CC^CTACTGCTTCAGCTCCACAGAGAAGAACTGCTGTGTACGGCAGCTGTACATTGACT 1316 

Qy 966 TCCGGAAGGACCTGGGCTGGAAGTGGATT(^TGMCCCAAGGGCTACC^TGCCAATTTCT 1025 

I MINIMUM lllllilllll II II MM Mill, MMM MM 

Db 1317 TTAGGAAGGACCTGGGTTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCT 1376 

Qy 102 6 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 108 5 



Db 



I lllllllllll llllllll llllllll Mill lllllllllllllllll II I 
13 77 GTCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACACAGTACAGCAAGGTCCTTGCCC 143 6 



Qy 


1086 


TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1145 






1 llllllll lllllllllll II II II 1 II 1 II 1 M 1 1 1 II IM 1 1 1 1 III 




Db 


1437 


TCTACAACCAACACAACCCGGGTGGTTCCGCATCACCGTGCTGCGTGCCGCAGGCTTTGG 


1496 


Qy 


1146 


AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 


1205 






MM lllllllllll ,11 lllllllll 




Db 


1497 


AGCCACTGCCCATCGTCTACTACGTGGGTCGCAAGCCCAAGGTGGAGCAGTTGTCCAACA 


1556 


Qy 


1206 


TGATCGTG CGTTCCTG CAAGTG CAG CTGA 1234 








Illlllllll llllllllllllllllil 




Db 


1557 


TGATCGTGCGCTCCTGCAAGTGCAGCTGA 158 5 





RESULT 7 

US-09-756-283A-19 

; Sequence 19, Application US/09756283A 
; Patent No. US20020151478A1 
; GENERAL INFORMATION: 

APPLICANT: Chernaj ovsky , Yuti 

APPLICANT: Dreja, Hanna Stina 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/0 9/756 , 28 3A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 19 

LENGTH: 13 76 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: LAP-mlFNbeta construct 
NAME /KEY : CDS 
LOCATION: (1) . . (1368) 
US-09-756-283A-19 



Query Match 48.3%; Score 657.2; DB 10; Length 1376; 

Best Local Similarity 88.3%; Pred. No. le-168; 

Matches 726; Conservative 0; Mismatches 93; Indels 3; Gaps 1; 
Qy 11 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 70 

Ml MM II lllllllllll 1 1 1 1 llllllllllllll II Ml 

Db 1 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 60 

Qy 71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

II M 1 1 1 1 1 II M IMIIIMI Mill I f 1 1 1 1 1 1 1 f 1 1 1 1 I [ I ; 1 1 1 1 ; I i I j 1 J 

Db 61 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 12 0 



Qy 131 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 MIMMIMI MIMIMIM Mill III 

Db 121 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 180 

Qy 191 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 2 50 



llllllllllllllllllll IIIIIMIIIII llllll I lllll II II II II 

Db 181 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 24 0 



Qy 


251 


Db 


241 


Qy 


311 


Db 


301 


Qy 


371 


Db 


361 


Qy 


431 


Db 


421 


Qy 


488 


Db 


481 


Qy 


548 


Db 


541 


Qy 


608 


Db 


601 


Qy 


668 


Db 


661 


Qy 


728 


Db 


721 


Qy 


788 


Db 


781 



TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

llllllll llllll I Ml llllllll II I I MM III III III II II III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 3 00 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 370 

II Mill Mill I II MM I III IIIMIMM I II 1 1 Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 36 0 

TATGATAAATTCAAGGGCACCCCCCAC^GCTTATATATGCTGTTCAACACGTCGGAGCTC 43 0 

lllll II llllll I I llllll llllllll I llllllll II llllll 
TATGACAAGTTCAAGCAGAGTACACACA^ 42 0 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT- - -GAGG 487 

II llllllll II lllll II llllllll III INI II, Mill I 1 1 1 

CGAGAAG CGGTAC CTG AA CC CGTGTTGCT CT CC CGGG CAGAG CTG CGTCTG CTGAGGAGG 4 8 0 
CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 547 

II I M 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 M M 1 1 II III 1 1 1 1 III I II I II I llllllll 

CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 54 0 
CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 607 

II 1 1 II Ml II M I M II 1 1 1 II 1 1 Mill lllll M llllll I II llllll 

CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 600 
GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 667 

Illlllllllllllllllllllll III III Ml II II lllll llllllll 

GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 66 0 

AGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 727 

II llllllll lllllllllllll lllllllllll II lllll II lllllllll 
AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 720 



I I I llllllll IMIMMIMIM Mill I I I M I I I I I I I I llllllll 



CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGC 82 9 

II M 1 1 1 1 1 1 1 1 M 1 1 III II 1 1 1 II 1 1 1 II lllll III 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGC 822 



RESULT 8 

US-09-756-283A-21 

; Sequence 21, Application US/09756283A 
; Patent No. US2 002015 1478A1 
; GENERAL INFORMATION: 

APPLICANT: Cherna j ovsky , Yuti 
; APPLICANT: Dre j a , Hanna St ma 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER : US/09/756 , 283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS : 100 



SOFTWARE: Patentln version 3.0 
SEQ ID NO 21 
LENGTH: 13 52 
TYPE: DNA 

ORGANISM : Artificial Sequence 
FEATURE : 

OTHER INFORMATION: mIFNbeta-LAP construct 
NAME/ KEY: CDS 
LOCATION: (1) . . (1344) 
US-09-756-283A-21 

Query Match 43.7%; Score 594.6; DB 10; Length 1352; 

Best Local Similarity 87.3%; Pred. No. l.le-151; 

Matches 664; Conservative 0; Mismatches 94; Indels 3; Gaps 1; 

GACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGT 132 

I II II I I II I I I I III I MMIIIIIll : I II II I I I h I I I I I I 
GGCGGGAGGGGGCTCAGCGGCCGCACTATCCACCTGCAAGACTATCGACATGGAGCTGGT 641 

GAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAG 192 

1 1 1 1 i 1 1 1 1 1 ' 1 1 1 1 1 . 1 ! 1 1 1 1 MMIIIIIll MMIIIIIll Mill Mill 

GAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAG 7 01 
CCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTA 2 52 

llllll IMIMIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II II II II II 

CCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTA 761 
CAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGC 312 

llllll MMMMMIMI llllllll MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

CAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGC 821 
GGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTA 372 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M M II I MMIIIIIll 

CGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTA 8 81 
TGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCG 432 

III II llllll I I llllll llllllll I I II II II III!: 1 1 1 

TGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTT 941 
GGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT GAGGCT 4 8 9 

llllllll II Mill II llllllll llllllllllllll Mill llllll 

AGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGGCT 1001 



Qy 


73 


Db 


582 


Qy 


133 


Db 


642 


Qy 


193 


Db 


702 


Qy 


253 


Db 


762 


Qy 


313 


Db 


822 


Qy 


373 


Db 


882 


Qy 


433 


Db 


942 



Qy 


490 


CAA.GTTAAAAGTGGAGC^G(^CGTGGAGCTATAC(^GAAATA(^G(^TGATTCCTGGCG 

llllllll llllllll! IIIIMIIIIMI MM II [IIIIIIIMI IIIIIMIII 

CAAGTTAAAAGTGGAG CAG CA CGTGGAG CTGTACCAGAAATA CAG CAACAATT CCTGG CG 


549 


Db 


1002 


1061 


Qy 


550 


CTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGT 

1 1 1 M II M II 1 1 1 1 1 1 II II II Mill Mill 1 1 llllll 1 II llllllll 
ATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGT 


609 


Db 


1062 


1121 


Qy 


610 


CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAG 

II IIIIMIIIIMI 1 llllll III III Ml II II Mill llllllll II 

CACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAG 


669 


Db 


1122 


1181 


Qy 


670 


TGCCCACTGTTCCTGTGACAGCAAAGATAAC^ 

llllllll 'II II ll 1 MMIIIIIll M Mill II IIIIIMIII 


729 



Db 



1182 CGCCCACTGCTCCTGTGACAGCAGGGATAA(^CACTGCyUVGTGGACATCAACGGGTTCAC 1241 



Qy 73 0 TTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCT 78 9 

I I IIIIIIII IIIIIIIIIMIIIIIIIII Mill II II! IIIIIIII II 

Db 124 2 TACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCT 13 01 

Qy 790 CATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCT 83 0 

II II IIMMMIII'IIIIIMIM! Mill MM 

Db 13 02 CATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCT 1342 



RESULT 9 

US-09-911-904-167 

Sequence 167, Application US/09911904 
Publication No. US20030096234A1 
GENERAL INFORMATION: 
APPLICANT: Farr, Spencer B. 
APPLICANT: Pickett, Gavin G. 
APPLICANT: Neft, Robin Eileen 
APPLICANT: Dunn, II, Robert Thomas 
TITLE OF INVENTION: CANINE TOXICITY GENES 
FILE REFERENCE: 400742000200 
CURRENT APPLICATION NUMBER : US/09/911,904 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: US 60/220,057 
PRIOR FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 386 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 167 
LENGTH: 4 89 
TYPE: DNA 

ORGANISM: Cams familiaris 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (1) . . . (489) 
OTHER INFORMATION: n = A, T, C or G 
US-09-911-904-167 

Query Match 26.0%; Score 354.4; DB 11; Length 489; 

Best Local Similarity 84.1%; Pred . No. 1.8e-86; 

Matches 445; Conservative 0; Mismatches 32; Indels 52; Gaps 2; 
Qy 774 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 833 

I lllllllllllll IIIIMI II 1 1 1 II ); 1 1 MM 1 1 1 1 1 . 1 1 1 1 1 1 II ' I II 

Db 1 GACCCTTCCTGCTCCTCATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCC 60 

Qy 834 GGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTC 8 93 

MM Mill 

Db 61 GGCAGCGCCG 70 

Qy 8 94 TGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGC 953 

1 1 1 r f 1 1 1 1 iiiiiii iiiiMi 

Db 71 -GGCCCTGGACACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGC 12 9 

Qy 954 TCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACC 1013 

II III IMM II Mill Illllllll Mill Mill IIIIIIII MM 

Db 13 0 TCTACATTGACTTCCGCAAGGATCTGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACC 18 9 



Qy 1014 ATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCA 1073 

I 1 1 II 1 1 III I III 1 1 II I III Ml I II II III I MM Mill III I llllll 

Db 190 ACGCTAACTTCTGCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCA 24 9 

Qy 1074 AGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGC 1133 

I II 1 1 1 1 II I II 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 II 

Db 250 AGGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGC 3 09 

Qy 1134 CGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGC 1193 

II I II I II I I I M I I I II II I II M I I II II I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 310 CGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGC 369 

Qy 1194 AGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCG - CCCACA 1252 

II II II I II III I Ml II II I III MM Mill Mill III MUM MM II II 

Db 370 AGCTGTCGAACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGTCCGGCA 42 9 

Qy 12 53 GCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGG 13 01 

I III II MM M MMMM 1 1 1 1 1 1 1 1 1 1 1 I Mill 

Db 4 3 0 GGCCCCGCCCACCGGCAGGNCCGGCCCCGCCCCCGCCCGCTGCGCCGGG 4 78 



RESULT 10 
US-09-813-271B-1 

; Sequence 1, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 
; biologically active protein 

NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US2 002 0 1 158 34Alart is Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/813 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/ 027 1 9 

FILING DATE: 12-Jul-95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25- Jul -94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22 640 

REFERENCE/DOCKET NUMBER: 4 -20039C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 



TELEPHONE : (908) 522-6940 
TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 33 9 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA to mRNA 
HYPOTHETICAL: NO 
IMMEDIATE SOURCE : 

CLONE: E . coli LC137/pPLMu . hTGF-betal (DSM 5656) 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1 . .336 

OTHER INFORMATION: /product= "human TGF-betal" 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-813-271B-1 

Query Match 22.7%; Score 308.6; DB 10; Length 339; 

Best Local Similarity 94.4%; Pred. No. 4.6e-74; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 8 96 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 955 

MINIM MINIM 1 1 1 1 M II M I II II II M 1 1 II 1 1 M I M M I II II II 1 1 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 6 0 

Qy 956 TA(>TTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 1015 

I ! F 1 1 ! 1 1 1 F 1 1 F 1 llllllll NIMMMINN II 1 1 INNINNINN 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 120 

Qy 1016 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1075 

INN llllllll llllllll llllllll 1 1 1 F 1 1 1 1 INN 1 1 1 M 1 1 , 1 1 M 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTAO\TTTGGAGCCTGGACACGCAGTACAGCAAG 18 0 

Qy 1076 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 1135 

iiiiiiii n : iii : 1 1 1 1 1 1 1 iiiiiiNiii ii inn i.iiiiii ii ii inn 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 1136 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1195 

ii i ii 1 1 1 1 1 1 1 1 1 i ii 1 1 1 : mi i m ii 1 1 1 1 1 1 ii i, i ii i m 1 1 , 1 1 1 1 1 1 1 1 ii 

Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 1196 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1234 

1 1 1 1 M 1 1 1 i Ml I M 1 1 1 1 1 1| 1 1 1 , 1 1 1 1 1 1 1 1 1 ! 

Db 3 01 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 339 



RESULT 11 
US-09-906-158-3 

; Sequence 3, Application US/09906158 
; Publication No. US200300782 17A1 
; GENERAL INFORMATION: 
; APPLICANT: Brett P. Monia 

APPLICANT: Susan M. Freier 
; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 



FILE REFERENCE: RTS-0257 

CURRENT APPLICATION NUMBER : US/09/906,158 
CURRENT FILING DATE: 2001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 3 
LENGTH: 2 574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: ( 2 54 ) . . . ( 14 92 ) 
US-09-906-158-3 

Query Match 18.3%; Score 249; DB 11; Length 2574 ; 

Best Local Similarity 53.5%; Pred. No. 1.2e-57 ; 

Matches 668; Conservative 0; Mismatches 530; Indels 51; Gaps 5; 

Qy 3 6 TG CCG CTG CTG CTG C CG CTG CTGTGG CTG CTAGTG CTG A CG CCTGG C CGG C CGG C CG CCG 95 

III Ml I I III II III 1,11 || | I || 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 32 0 

Qy 96 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 155 

llllll II MM III I III I I I MM Mill I II MM 

Db 321 CTCTGTCCACTTGC^CC^CCTTGGACTTCGGCCA(^T(^GAAGAAGAGGGTGGAAGC(^ 38 0 

Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

II I II Mill II MUM MM IIIMIMM I M II 

Db 381 TTAGGGGAC^GATCTTGAGCAAGCTC^GGCTC^CC^GCCCCCCTGAGCCAACGGTGATGA 44 0 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 275 

I I II Mil M I II I I lllllllllll Mill II II II 

Db 441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 2 76 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 32 0 

M MM MM I I I Ml MM 1 1 1 1 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 321 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 380 

I Mill II II I MM I I I I Ml 

Db 555 ATGCCAAAGAAATCCATAAATTCGACATGATCCAGGGGCTGGCGGAGCACAACGAACTGG 614 

Qy 381 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 44 0 

Ml I I I I I III I II I I M 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 441 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 500 

M III M I I Mill II III I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 501 TGGAGCAGCACGTGGAG CTATACCAG AAATACAG CAATG 53 9 

Ml II I III II I I I II II 

Db 73 5 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 794 



Qy 54 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 599 

MUM III II Ml Ml M II I 1 1 1 1 1 1 1 M I 

Db 7 95 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 854 



Qy 600 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 65 9 

lllllllllllll I llllll llllllll I MINI I I IN 
Db 855 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 

Qy 660 TTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACA CACTCCACGTGG 713 

I I I I I III MM III III I I II Ml I 

Db 915 TAGAAAT CAG CATTCA CTGT C CATGT CA CACCTTT CAG C CCAATGG AGATAT C CTGGAAA 974 

Qy 714 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCC^CCATTCACGGCATGAACC 773 

I Ml III I I I I I I III II III II 

Db 975 A CATTCA CGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 774 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 833 

I llllllll II I M Ml Ml M M 

Db 103 5 GTGGAGATCTGGGGCGCCT - - - CAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCC 1091 

Qy 834 GGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTC 8 93 

III I II I II I I I I I I I 

Db 1092 TCATGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGC 1151 

Qy 894 TGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGC 953 

III MM Mill IMMIIII II I MM IIIMIMM Mill I I 

Db 1152 GGGCTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCC 1211 

Qy 954 TCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACC 1013 

1 1 1 M II M 1 1 1 1 1 1 1 MM IIMIIIMMIIM I MMMM IMMIIII 

Db 1212 TCTACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACT 1271 



Qy 1014 ATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCA 1073 

1 1 1 1 1 ! I MIMI M II II II III II I II llllll MUM 

Db 12 72 ATGCCAACTTCTGCTCAGGCCCTTGCCC^TACCTCCGC^GTGCAGACACAACCCACAG^ 1331 

Qy 1074 AGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGC 1133 

III MM MMMM I Mill I II II II MM MMIIIIII 

Db 1332 CGGTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGC 13 91 

Qy 1134 CG CAGG CG CTGG AG C CACTG C C CATCGTGTACTA CGTGGG C CG CAAG C C CAAGGTGGAG C 1193 

I MM MMMM Ml Mill M 1 1 1 1 1 MM II Mill llllll 

Db 13 92 CCCAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGC 14 51 

Qy 1194 AGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGC 1242 

MM 1 1 1 1 1 1 1 1 1 I Ml II II M II lllllll MM I 

Db 14 52 AG CT CT C CAA CATGGTGGTG AAGT CTTGTAAATGTAG CTGAGA CC C CA C 1500 



RESULT 12 
US-10-028-158-20 

; Sequence 20, Application US/10028158 

; Publication No. US20020110833A1 

; GENERAL INFORMATION: 

; APPLICANT: Caniggia, Isabella 

; APPLICANT: Post, Martin 

; APPLICANT: Lye, Stephen 

; TITLE OF INVENTION: METHODS TO DIAGNOSE A REQUIRED REGULATION OF 

TITLE OF INVENTION : TROPHOBLAST 
; FILE REFERENCE : 11757. 38USWO 
; CURRENT APPLICATION NUMBER: US/10/028 , 158 



CURRENT FILING DATE : 2001-12-20 
PRIOR APPLICATION NUMBER: US/09/38 0,662 
PRIOR FILING DATE: 1999-12-21 
PRIOR APPLICATION NUMBER: PCT/CA9 8 / 00 18 0 
PRIOR FILING DATE: 1998-03-05 
PRIOR APPLICATION NUMBER: US 60/039,919 
PRIOR FILING DATE: 1997-03-07 
NUMBER OF SEQ ID NOS : 24 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 20 
LENGTH: 2 574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (254) . . (1492) 
US-10-028-158-20 

Query Match 18.3%; Score 249; DB 13; Length 2574; 

Best Local Similarity 53.5%; Pred. No. 1.2e-57; 

Matches 668; Conservative 0; Mismatches 530; Indels 51; Gaps 5; 

Qy 3 6 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 95 

III III I I III II III MINI III I II 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 320 

Qy 96 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 155 

1 1 II II II II 1 1 MINIMI I I III I MM Ml MM 

Db 321 CTCTGTC»CTTG(^CCACCTTGGACTTCGGCC^C^TCAAGAAGAAGAGGGTGGAAGCCA 380 

Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

II I II Mill II MUM MM llllllllll I II II 

Db 381 TTAGGGG ACAG AT CTTGAG CAAG CT CAGG CT CA CCAG CC C C CCTG AG C CAA CGGTGATGA 44 0 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 275 

MM Mil II Mill 1 1 1 1 1 1 1 : 1 1 1 MM II II M 

Db 441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 276 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 32 0 

M II II MM I I I Ml MM MM 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 321 ACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGC^C(^UUVTCTATGATAAAT 3 80 

I II I I I II II I Ml I I I III II 

Db 555 ATGCCAAAGAAATCC^TAAATTCGACATGATCCAGGGGCTGGCGGAGCAa^CGAACTGG 614 

Qy 381 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 44 0 

III I I M MM I I I I I II 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 44 1 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 500 

II III II I I I Ml I II III I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 501 TGGAG CAG CA CGTGGAG CTATACCAGAAATACAGCAATG 53 9 

Ml I M Ml M I I I II II 

Db 73 5 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 7 94 



Qy 54 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 599 

IIIIII III II III II I I I I I I IIIIIIIIII 
Db 795 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 854 

Qy 600 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 659 

MINIMUM I IIIIII MINIM I IIIIII I I III 

Db 855 GCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 
Qy 660 TTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACA CACTCCACGTGG 713 

I MM IIIIII I III Ml I I N III I 

Db 915 TAGAAAT CAG CATT CA CTGT C CATGT CA CAC CTTTCAG C C CAATGGAGATATC CTGG AAA 974 

Qy 714 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACC 773 

I III III I I I I I I III II III M 

Db 975 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 7 74 GGCCCTTCGTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 833 

I III I I I II Ml II III III II M 

Db 1035 GTGGAGATCTGGGGCGCCT- - - CAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCC 1091 

Qy 834 GGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTC 8 93 

III I II INI I I I I I I 

Db 1092 TCATGATGATTCCCCCACACCGGCTCGACAACCCGGGCC^GGGGGGTC^GAGGAAGAAGC 1151 

Qy 894 TGGCCCTGGATACC^CTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGC 953 

III MM Mill IIIIIIIM II I MM I INI INN INN I I 

Db 1152 GGGCTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCC 1211 

Qy 954 TCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACC 1013 

IMIIIIIIIINIM Mil MINIMNMIM I lilllll) MINN 

Db 1212 TCTACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACT 1271 

Qy 1014 ATGCCAATTTCTGCCTGGGGCCCTGTCCCTAC^TCTGGAGCCTAGAC^CTCAGTACAGCA 1073 

lilllll IIIIII N II II II III II I II IIIIII IIIIII 

Db 12 72 ATGCCAACTTCTGCTCAGGCCCTTGCCC^TACCTCCGCAGTGCAGACACAACCCACAG^ 1331 

Qy 1074 AGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGC 1133 

III WW 1 1 1 1 I INN I II II II INI MINIUM 

Db 1332 CGGTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGC 13 91 

Qy 1134 CGGAGGCGCTGGAGCC^CTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGC 1193 

I INI IIIIINI III Mill lilllll II II I I II 1 1 1 lilllll 

Db 13 92 CCCAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGC 14 51 

Qy 1194 AGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGC 1242 

INI IIIIINN I IN II II II II lilllll INI I 

Db 14 52 AGCTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGAGACCCCAC 15 00 



RESULT 13 
US-09-957-458B-9 

Sequence 9, Application US/09957458B 
Publication No. US20030166271A1 
GENERAL INFORMATION : 
APPLICANT: Chen, Una 

TITLE OF INVENTION: Method for growing stem cells 



FILE REFERENCE: P66567US0 

CURRENT APPLICATION NUMBER : US/09/957 , 4 58B 
CURRENT FILING DATE: 2001-09-21 
PRIOR APPLICATION NUMBER: PCT/EPOO/08247 
PRIOR FILING DATE: 2000-08-24 
PRIOR APPLICATION NUMBER: EP 99116533 
PRIOR FILING DATE: 1999-08-24 
NUMBER OF SEQ ID NOS : 10 
SOFTWARE: Patent In version 3.2 
SEQ ID NO 9 
LENGTH: 43 82 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION : Vector for transforming supporting cell with a foreign to 
express 

OTHER INFORMATION: a gene product of interest 
US-09-957-458B-9 

Query Match 17.8%; Score 242.8; DB 12; Length 4382; 

Best Local Similarity 53.0%; Pred . No. 6.6e-56; 

Matches 663; Conservative 0; Mismatches 542; Indels 45; Gaps 5; 
Qy 2 6 CTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGG 85 

1 1 1 1 ii i iii in i i iii i iii mill i i 

Db 4 55 CTG CAG C C CATG CA CTTG CAAAGGG CT CTGGTAGT CCTGG CC CTGCTG AA CTTGG CCA CA 514 

Qy 86 CCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGC 14 5 

I I MINIM MM II MM I I I I III MM I 

Db 515 ATCAGCCTCTCTCTGTCCACTTGCACCACGTTGGACTTCGGCCACATCAAGAAGAAGAGG 574 

Qy 14 6 ATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAG 2 05 

I M MINI I 1 1 MMI 1 1 MMM MM MMMMM I 

Db 5 75 GTGG AAG CCATTAGGGGA CAG AT CTTG AG CAAG CT CAGG CTCA C CAG C C CCC CTG AG C CA 634 

Qy 2 06 GGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGC 2 65 

M II I I II Mil II Mill 1 1 1 1 1 1 1 1 1 1 1 MMI 

Db 63 5 TCGGTGATGACCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGG 688 

Qy 266 GACCGGGTAGCCG GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

II Mil MM II III MM 

Db 68 9 GAGTTGCTGGAAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAG 74 8 

Qy 311 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 370 

I M MMI MMI III M I MM I I II 

Db 74 9 TCTGAGTACTATGCCAAAGAGATCCATAAATTCGACATGATCCAGGGACTGGCGGAGCAC 8 08 

Qy 371 TATGATAAATTCAAGGGCACCCCCCTVCAGCTTATATATGCTGTTCAACACGTCGGAGCTC 43 0 

MM I II M I III I I II Ml 

Db 8 09 AATGAACTGGCCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCC 868 

Qy 431 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 490 

III III II II I M I MM II I 

Db 8 6 9 TCAGTGGAGAAAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCC 92 8 
Qy 4 91 A AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAA 52 9 

I M I II MM llll Ill 



Db 



92 9 AACCCCAGCTCCAAGCGCACAGAGCAGAGAATTGAGCTCTTCCAGATACTTCGACCGGAT 98 8 



Qy 53 0 TACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCG 58 9 

I II I Mill II II III II I II I II I 

Db 98 9 GAGCACATAGCCAAGC^GCGCTACATAGGTGGC^GAATCTGCCCACAAGGGGCACCGCT 104 8 

Qy 590 GAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCT 64 9 

II MINIM II 1 1 r 1 1 1 1 1 I MIMI MM I MINI I 

Db 104 9 GAATGGCTGTCTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCC 1108 

Qy 65 0 ATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCAC 7 09 

I MM I MM 1 1 1 1 1 1 I III III I I II M 

Db 1109 AACTTGGGTCTGGAAATCAGCATCCACTGTCCATGTCACACCTTTCAGCCCAATGGAGAC 1168 

Qy 710 GTGGAAATTAACG - GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCAT 768 

I Ml II I I I M I II 

Db 1169 ATACTGGAAAATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGAT 1228 

Qy 76 9 GAACCGGGCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGGCCAGCACCTGCACAG 82 8 

II I I II I II III I Ml I I II I 

Db 122 9 GACCATGGCCGTGGAGACCTGGGGCGTCTCAAGAAGCAAAAGGATCACCACAACCCACAC 128 8 

Qy 82 9 CTCCCGGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGC 8 8 8 

II I M Ml I Ml III III I MM I I 

Db 12 8 9 CTGATCCTCATGATGATCCCCCCACACCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGG 134 8 

Qy 88 9 A - -TCTCTGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTG 94 6 

I IMMMM IMM MMMIM M I MM MMMMM M 

Db 134 9 AAGAAGAGGGCCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTA 14 08 

Qy 94 7 CGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAG 1006 

M I II II 1 1 1 1 1 1 1 111 MM II I'M, 1 Ml I II Mill III 

Db 14 0 9 CGCCCCCTTTATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAG 1468 

Qy 1007 GGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAG 1066 

II Ml Mlil MUM II M II M III M I III MIMI 

Db 1469 GGTTACTATGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACC 1528 

Qy 1067 TACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGGGCCGTGC 112 6 

I MM III II I M MMM I Mill I Mill M MM Ml 

Db 152 9 CATAGCACGGTGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCGTGGCCATGC 1588 

Qy 112 7 TGCGTGCCGCAGGCGGTGGAGCCAGTGCCCATCGTGTACTAGGTGGGCCGCAAGCCCAAG 118 6 

IIIII II MM Mil Ml Mill Mill MIMI I I MIMI 

Db 158 9 TGCGTCCCCCAGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCGCAAG 164 8 

Qy 1187 GTGGAGCAGCTGTCCAACATGATCGTGCGTTCGTGCAAGTGCAGCTGAGG 1236 

Ml Mill 1 1 MM MMM I I Ml II I! MIMI MMM! I 

Db 164 9 GTGGAGCAGCTGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGAGG 1698 



RESULT 14 
US-09-906-158-10 

; Sequence 10, Application US/09906158 

; Publication No. US20030078217A1 

; GENERAL INFORMATION: 

; APPLICANT: Brett P. Monia 



APPLICANT : Susan M. Freier 
; TITLE OF INVENTION : ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

FILE REFERENCE; RTS-0257 

CURRENT APPLICATION NUMBER : US/09/906,158 
CURRENT FILING DATE: 2001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 10 
LENGTH: 28 79 
TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (611) . . . (1843) 
US-09-906-158-10 

Query Match 17.6%; Score 239.6; DB 11; Length 2879; 

Best Local Similarity 53.0%; Pred. No. 4.4e-55; 

Matches 658; Conservative 0; Mismatches 539; Indels 45; Gaps 5; 
Qy 3 6 TGCCGCTGCTG CTG C CG CTG CTGTGG CTG CTAG TG CTGA CG CCTGGCCGGCCGGCCGCCG 95 

Ml Ml MIMI III IIMII II II 

Db 612 TGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCACAATCAGCCTCT 671 

Qy 96 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 155 

MINIM MM MINIMI I Ml MM I I II MM 

Db 672 CTCTGTCCACTTGCACCACGTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 731 

Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

II I II lllll II MINI INI IIIIIIIIII I II II 

Db 732 TTAGGGGAC^GATCTTGAGC^AGCTCAGGCTCACCAGCCCCCCTGAGCCATCGGTGATGA 7 91 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 2 75 

MM I II I II INN IMMIMIM lllll II Ml 

Db 792 CCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGGGAGTTGCTGG 84 5 

Qy 276 CCG GGGAAAGTGTCGAAC CGGAG C C CG AG C CAGAGG CGGA CTACT 320 

I INI I I I II I Ml I II Ml 

Db 84 6 AAG AGATG CACGGGGAGAGGGAGG AAGG CTG CACT CAGGAGAC CT CGGAG T CTGAGTACT 9 05 

Qy 321 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGG(^CC^AATCTATGATAAAT 380 

I INN III II I III I I I II INI 

Db 906 ATGCCAAAGAGATCCATAAATTCGACATGATCCAGGGACTGGCGGAGCACAATGAACTGG 965 

Qy 381 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 44 0 

I Nil I MM III II I I II 

Db 966 CCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCCTCAGTGGAGA 102 5 

Qy 441 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCA 4 91 

III II II I II INI I II II 

Db 102 6 AAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 108 5 

Qy 4 92 AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATG 53 9 

II Ml INI I I II I I I I II I 

Db 1086 CCAAGCGCACAGAGCAGAGAATTGAGCTCTTCCAGATACTTCGACCGGATGAGCACATAG 114 5 



Qy 54 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 599 



Illllll II II III II I II I I I I II Illllll 

Db 114 6 CG^GCAGCGCTACATAGGTC 12 05 

Qy 600 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 65 9 

I II llllllll I llllll llllllll I llllll I I I I I I 

Db 12 06 CTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCCAACTTGGGTC 1265 

Qy 660 TTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTA 719 

I I I I I Illllll I III III I I II III I 

Db 12 66 TGGAAATCAGCATCCACTGTCCATGTCACACCTTTCAGCCCAATGGAGACATACTGGAAA 132 5 

Qy 72 0 ACG - GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCC 778 

llllll I II I I I I I I I II 

Db 13 2 6 ATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGATGACCATGGCC 138 5 

Qy 779 TTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGGCCAGCACCTGCACAGCTCCCGGCAC 838 

III II I I III I Mil II 

Db 13 86 GTGGAGACCTGGGGCGTCTCAAGAAGCAAAAGGATCACCACAACCCACACCTGATCCTCA 144 5 

Qy 83 9 CGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCT- - CTGG 8 96 

I II Ml I III III I I I I I III I I I II 

Db 144 6 TGATGATCCCCCCACACCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGGAAGAAGAGGG 15 05 

Qy 8 97 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 956 

Illllll Mill IIMIIIII II I MM llllilllll II II I II I 

Db 1506 CCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTACGCCCCCTTT 1565 

Qy 957 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 1016 

I 1 1 1 1 ' 1 1 1 1 1 1 1 MM II llllllll III I II Mill Mill III III 

Db 1566 ATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAGGGTTACTATG 162 5 

Qy 1017 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1076 

MM MIMI II II II II III II I III MUM I MM II 

Db 162 6 CC^CTTCTGCTC^GGCCCTTGCCC^TACCTCCGC^GCGC^GACAC^ACCCATAGCACGG 168 5 

Qy 1077 TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1136 

I II I M llllll I Mill I Mill II MM MMMM II I 

Db 1686 TGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGCTGCGTCCCCC 174 5 

Qy 1137 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1196 

Ml MMMM Ml Mill Illllll llllll I I 1 1 1 1 1 1 1 1 1 r f 1 1 1 1 1 

Db 174 6 AGGACCTGGAGCCCCTGACCATGTTGTACTATGTGGGCAGAACCCCCAAGGTGGAGCAGC 18 05 

Qy 1197 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCC 1238 

II Illllll I III II II llllllllllllllll 

Db 18 06 TGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGAGGCC 184 7 



RESULT 15 
US-09-813-271B-7 

; Sequence 7 , Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 



NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: No. US200201 15834Alart is Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02719 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25 -Jul -94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 2264 0 

REFERENCE/DOCKET NUMBER: 4 -2 003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 33 6 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA of 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu . TGF-beta 1 ( 44 /4 5 ) beta3 
FEATURE : 

NAME/KEY: mat_peptide 

LOCATION: 1..132 

OTHER INFORMATION: /product= "N- terminal 44 amino 
acids of human TGF-betal" 
FEATURE : 

; NAME/KEY: mat_peptide 

LOCATION: 133 . .336 

OTHER INFORMATION: /product= "C- terminal 68 amino 
; acids of human TGF-beta3" 

FEATURE : 

NAME /KEY: CDS 
LOCATION: 1..336 

OTHER INFORMATION: /product^ "hybrid TGF-beta named 
TGF-beta 1-3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-813-271B-7 



Query Match 



16.7%; Score 227.2; DB 10; Length 336; 



Best Local Similarity 79.8%; Pred. No. 6.2e-52; 

Matches 268; Conservative 0; Mismatches 68; Indels 0; Gaps 0; 



Qy 8 96 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 95 5 

MINIM MMMM M M M M M M M I M M M M M M M M M M M M M 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 956 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 1015 

MMMMMMM MMMM MMMMMMM M M MMMMMMMI 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 12 0 

Qy 1016 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 107 5 

MMI M I M I M M M M Ml M I 1 1 MMM MMM I 

Db 121 GCC^ACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACG 180 

Qy 1076 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 113 5 

M MM IMMMM I MMI I II II II MM MMMM 

Db 181 GTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCC 24 0 

Qy 113 6 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1195 

MM MMMM Ml MMI MMM MM II MMI IMMMM 

Db 241 CAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAG 3 00 

Qy 1196 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGC 1231 

II IMMMM I Ml M II II II III 

Db 3 01 CTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGC 33 6 

Search completed: October 28, 2003, 09:04:41 
Job time : 411.672 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 10:47:27 ; Search time 3377.39 Seconds 

(without alignments) 
9794.056 Million cell updates/sec 

Title: US-10-017-372E-3 8 

Perfect score: 1361 

Sequence: 1 tggtaccgagatggcgcctt cgattaaagcggccgcgact 1361 

Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 45562784 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 



Listing first 45 summaries 



Database : EST:* 

1 : em__estba : * 

2: em_esthum:* 

3: em_estin:* 

4: em_estmu:* 

5: em_estov:* 

6: em_estpl:* 

7: em__estro:* 

8: em_htc:* 

9 : gb_estl : * 

10: gb_est2:* 

11: gb_htc:* 

12: gb_est3:* 

13: gb_est4:* 

14: gb_est5:* 
15: em_estfun:* 
16: em_estom:* 
17: em_gss_hum:* 
18 : em_gss_inv: * 
19: em_gss_pln:* 
20: em_gss_vrt:* 
21: en\_gss_fun: * 
2 2 : em_gs s_mam : * 
2 3 : em_gss_mus : * 
24: em_gss_pro:* 

2 5 : em_gs s_rod : * 
2 6 : em_gs s_phg : * 
27: em_gss_vrl:* 

28: gb_gssl:* 

29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed; 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
BM562135 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BM562135 1072 bp mRNA linear EST 20-FEB-2002 

AGENCOURT_6562032 NIH_MGC_118 Homo sapiens cDNA clone IMAGE : 5745463 
5', mRNA sequence. 
BM562135 

BM562135 . 1 GI : 18807966 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1072) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpubl ished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Life Technologies, Inc. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : //image . 1 lnl . gov 
Plate; LLAM12768 row: h column: 08 
High quality sequence start: 9 
High quality sequence stop: 647. 

Locat ion/Qua 1 i f lers 

1. .1072 

/organism= " Homo sap iens " 

/mo 1 _t ype = " mRNA 11 

/db_xref ="taxon: 9606" 

/clone=" IMAGE: 5745463" 

/ 1 i ssue_type= " 1 eukocyt e " 

/lab_host= M DH10B" 

/ c 1 one_l ib= "NI H_MGC_1 1 8 " 

/note=" Vector: pCMV-SPORT6; Site_l: NotI; Site_2: EcoRV 
(destroyed) ; RNA source leukocytes from anonymous pool of 
non-activated adult donors. Library is oligo-dT primed 
and directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
1.2-3.3 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 027. Note: 
this is a N I H_MGC Library." 
217 a 356 c 313 g 186 t 



Query Match 51.9%; Score 706.8; DB 12; 

Best Local Similarity 81.8%; Pred. No. 3.6e-144; 
Matches 888; Conservative 0; Mismatches 142; Indels 



Length 1072; 

56; Gaps 



4; 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 



127 GCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCT 18 6 

1 1 II 1 1 1 1 1 1 1 i 1 1 1 1'l 1 1 1 1 lllllll MINIUM MMMMMI MMI 

2 0 GCTGGTGAAGCGGAAGCGCATCTAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCT 79 
187 TGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGC 24 6 

MMMMMMMMMMMI I M M M 1 1 1 1 M M M M I MMI 1 1 M M 

8 0 CG C CAG CC C C C CGAG C CAGGGGG AGGTG C CG C C CGG C C CG CTG C C CGAGG C CGTG CTCG C 13 9 
24 7 TCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCC 3 06 

M MMMM MMMMMMM IMMMI MM MMMMMMMMI 

14 0 CCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCC 199 
3 07 AGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCA 366 

MMI IIMIIMMMMMMIMMIMMMMMIIMIMM I 1 1 1 1 I 

2 00 TGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGA 25 9 

3 67 AATCTATGATAAATT(^GGG(^CCCCC(^C^GCTTATATATGCTGTTCAACACGTCGGA 426 

IMMMM II MUM I I llllll MMMM I IMMMI II II 

260 AATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTC 319 

4 27 GCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAG 4 8 6 

llllll IMMMI II MMI II MMMM MMMMMMM IMMMI 

32 0 GCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAG 379 

4 87 GCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTG 54 6 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I lllllll 



Db 



380 



GCTCAAGTTAAAAGTGGAGC^GC7VCGTGGAGCTGTACCAGAAATAa\G(^ACAATTCCTG 4 3 9 



Qy 


547 


Db 


440 


Qy 


607 


Db 


500 


Qy 


667 


Db 


560 


Qy 


727 


Db 


620 


Qy 


787 


Db 


680 


Qy 


847 


Db 


740 


Qy 


907 


Db 


750 


Qy 


967 


Db 


810 


Qy 


1027 


Db 


870 


Qy 


1087 


Db 


930 


Qy 


1145 


Db 


990 


Qy 


1201 


Db 


1050 



TGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCT 666 

1 1 1 1 1 1 1 1! 1 1 1 1 II III ; 1 1 1 1 1 III Ml III II II Mill IMI'll 

TGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCT 559 

CAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTT 72 6 

ii 1 1 1 1 1 1 1 1 i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii mil ii r 1 1 1 1 1 1 1 

TAGCGCCCAGTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTT 619 



MMMM MMMMMMMMMM MMMMIMMI MIMMI 



CCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCGCGGCACGGCCGAGC 84 6 

1 1 1 ! 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 J 1 1 1 f 1 1 1 1 J I 

TCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGC 73 9 



847 CCTGGATACCAACAGCTAGCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATAC 906 

Illllll II 

74 0 CCCTGGACAC 74 9 



CAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGGGGCAGCTCTACATTGACTT 

M M II I M M 1 1 1 1 1 1 1 MMMMMIMIMMIIIMIMM MMMMMI 

CAACTATTGCTTCAGCTCCCCGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTT 



966 



809 



CCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTG 102 6 

Ml I M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II III I MIMMI III I Mill 

CCGCAGGGACCTCGGCTGGAAGTGGATCCACGAGCCCCAAGGCTACCATGCCCACTTCTG 8 6 9 
CCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCT 1086 

III MMMM MIMMI MIMMI Mill I Illllll 1 1 1 1 1 1 II 

CCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCGTACCAGCAAGTCCCTGGCCCT 92 9 
GTACAACCAGCACAACCCGGGCGCGTGGGCGGCGCCGTGCTGC- -GTGCGGCAGGCGCTG 114 4 

Mill M M MM II I IIMMIMI I III I M II Ml 

GTACACCCGCCATAACCGGGCGCCCTCGGCGGCGCGGGTTTGCTGGGCCCCAGGCCCCTG 98 9 
GAGCCACTGCCCAT - - CGTGTACTACGTGGGCCGC - -AAGCCCAAGGTGGAGCAGCTGTC 12 00 

II II Mill I I Illllll MM III I 11,1 Ml M Mill 

GACCCCCTGCGCCTTCGGGGTACTACCTGGGGCGCCAAGCCCCAAGTGGGAACACCTGTC 104 9 



I MM 



RESULT 2 

BX355682/C 

LOCUS 

DEFINITION 



BX355682 1201 bp mRNA linear EST 05-MAY-2003 

BX355682 Homo sapiens PLACENTA COT 25 -NORMALIZED Homo sapiens cDNA 
clone CS0DI002YJ05 3-PRIME, mRNA sequence. 



ACCESSION BX3 55682 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX355682 . 1 GI : 3 0371987 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1201) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 916G.r For 
more information about this cluster, see 
ht tp : / /www . genoscope . ens . f r/ 

cgi -bin/cluster . cgi?seq=CSODI 0 02CE03NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI002CE03NP1 . 

Locat ion/Qua 1 l f iers 

1. .1201 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone="CS0DI002YJ05" 

/tissue_type=" PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 25 -NORMALIZED" 
/note="lst strand cDNA was primed with a Not I -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
198 a 326 c 386 g 253 t 38 others 



Query Match 51.6%; 
Best Local Similarity 82.1%; 
Matches 904 ; Conservative 



Score 701.6; DB 13; 
Pred. No. 5.1e-143; 
8; Mismatches 124; 



Length 1201; 
Indels 65; Gaps 



Qy 



Db 



7; 



260 ACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG-GCGGACTA 318 

I MINIMUM Mill I I ! 1 1 1! M 1 1 1 1 1 1 1 1 1 Ml MMMI 

104 6 AACCGCGACCGGGTGGCCGGG - - AGAKKCAGAACCGGAGCCCGAGCCTGAGCCCGRACTA 98 9 



Qy 

Db 



319 



988 



CTACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAA 

M 1 1 1 1 1 1 1 1 1 1 1 M = 1 1 1 1 1 1 1 1 II 1 1 1 1 1 r 1 1 1 1 1 1 I MM MMMMM 1 1 

CTACGCCAAGGAGGTMACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAA 



378 



929 



Qy 

Db 

Qy 

Db 



37 9 ATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGC 438 

MMM I I MMM MMMM I MMMM 1 1 MMMM MMI 

92 8 GTTC^GCAGAGTA(^CACAGCATATATATGTTCTTC^CACATCAGAGCTCCGAGAAGC 8 69 



439 



497 



GGTGCCGGAACCTGTATTGCTCT - CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAA 

Ml II Mill II MMMI I 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 i 1 1 1 ! 1 1 1 

868 GGTACCTGAACCCGTGTTGCTCTCCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAA 8 09 



Qy 4 98 AAGTGGAGCAGCACGTGGAGCTATACC^GAAATA^GQVATGATTCCTGGCGCTACCTCA 557 

I MM MM lllllllll IIIIIIIIMIIIII I MMMMM MMMI 

Db 8 08 AAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATAGGTCA 74 9 

Qy 5 58 GCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTGCTTTGATGTCACCGGAG 617 

MMMMMMMM MMI MMI 1 1 MMM I 1 1 MMMMMMIMI 

Db 74 8 GCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAG 68 9 

Qy 618 TTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 677 

MMMMMMM Ml Ml III II II M II llllllll II MMMI 

Db 688 TTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACT 63 0 

Qy 678 GTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCC 73 7 

I IMIIIIIIIIII lllllllllll II Mill II Mllllllll I I MM 

Db 62 9 GCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCC 57 0 

Qy 738 GCCGGGGTGACCTGGCCACCATTCACGGCATGAACGGGCCCTTCCTGCTCCTCATGGCCA 7 97 

MM llllllllllllllllllll MMMMMMM llllllll MMMMM 

Db 569 GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 510 

Qy 7 98 CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACGGCCGAGCCCTGGATACCA 857 

MMIMIMMMIIIIIMI Mill IIIMIIIIIIIIMIIIIIIIIIII 

Db 5 09 CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 4 54 

Qy 858 ACAGCTACCCATACGACGTGGCAGACTACGCATCTCTGGCCCTGGATACCAACTACTGCT 917 

llllllll MM 

Db 4 53 ACCAACTATTGCT 441 

Qy 918 TCAGCTCCAGGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 977 

I M 1 1 1 1 1 1 1 II 1 1 1 1 1 IIIIIIMIIIIIMIIII MMMMMMM MMMI 

Db 44 0 TCAGCTCCACGGAGAAGNACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGAGC 381 

Qy 978 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTAGCATGCCAATTTCTGCCTGGGGGCCT 1037 

I MMMMMMM II II 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 II llllllll MMMI 

Db 380 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTAGCATGCCAACTTCTGCCTCGGGCCCT 321 

Qy 1038 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1097 

I llllllll llllllll Mill III IMIIIIIIIIII 

Db 32 0 GGGCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAAGCAGC 2 61 

Qy 1098 ACAACCCGGGCGCGTCGGCGGCGCGGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 1157 

I lllllllllll ! i ! 1 1 M 1 1 . 1 1 1 1 1 II 1 . 1 1 , 1 1 1 ! I ! 1 1 1 1 MMMI 

Db 2 60 ATAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGGTGGAGGCGCTGGCCA 201 

Qy 1158 TCGTGTACTACGTGGGCCGCAAGGGCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1217 

I I ' . I I I | 

Db 2 00 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 141 

Qy 1218 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGGCCGGC 1277 

I I M 1 1 1 1 II 1 1 1 M 1 1 1 1 MMMMM M I II I I I MMI I 

Db 14 0 CCTGCAAGTGCAGCTGAGGTGCCGCCCCGCCCCGCCCCGCCCCRGCAGGCCCGGCCGCAC 81 
Qy 12 78 GCCACCCCCGCCGGCCT CACCGGGGCTGTATTTAAGGACATCGTGCGCCAAG 132 9 

Ml MM III II II MMMMMMM MM I lllllll 

Db 8 0 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAA-GACACCCGTCCCCAAG 22 

Qy 133 0 CCCACTTGGGATCGATTAAAG 13 5 0 



Mill -Ihllh 

Db 21 CCCACNNAMNNMMDATHAAAS 



RESULT 3 
BX349319 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BASE COUNT 
ORIGIN 



BX349319 900 bp mRNA linear EST 05-MAY-2003 

BX349319 Homo sapiens B CELLS (RAMOS CELL LINE) COT 2 5 -NORMALIZED 
Homo sapiens cDNA clone CS0DL010YL07 5 -PRIME, mRNA sequence. 
BX349319 

BX349319. 1 GI :30379410 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 900) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http: //www. genoscope. ens . fr/ 

cgi -bin/cluster . cgi?seq=CS0BAG05 9ZDO4_CS055 96_l&cluster=9160 . r . 
Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSOBAG059ZD04_CS05596_1 . 

Loc a t i on / Qua 1 i f i e r s 

1. .900 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone= M CS0DL010YL07" 

/cell_type="B CELLS (RAMOS CELL LINE) COT 2 5 -NORMALIZED" 
/cell_line= "RAMOS CELL LINE" 

/clone_lib="Homo sapiens B CELLS (RAMOS CELL LINE) COT 
2 5 -NORMALIZED" 

/note="lst strand cDNA was primed with a Not I -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
179 a 292 c 270 g 155 t 4 others 



Query Match 50.8%; 
Best Local Similarity 88.1%; 
Matches 752; Conservative 



Score 692; DB 13; Length 900; 
Pred. No. 5.8e-141; 
0; Mismatches 102; Indels 0; 



Gaps 



Qy 

Db 



30 



CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II Ml 1 1 1 1 II lllllllllll ''lull, II 1 1 1 1 1 1 1 1 1 1 1 1 , I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 



65 



89 



Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I Illlllllllllllllllllllllll Mill llllllllllllll MINIMI 

Db 90 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 14 9 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

I M 1 1 1 1 1 1 1 1 1 1 1 1 1 II II I II 1 1 1 1 1 1 1 1 lllllllllll lllllllllll MM 

Db 15 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 2 09 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I MMMMMMIMMMMM 1 1 1 1 1 1 M 1 1 1 1 ! M 1 1 M I MMI II II I 

Db 210 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 269 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll 1 1 1 1 1 1 1 1 1 II 1 1 1 llllllll MM II I M II M M II M I 

Db 270 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 32 9 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill i 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 J J 1 1 1 1 1 j 1 1 E 1 1 1 1 ! I ! I MM 

Db 33 0 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 38 9 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

MMMMM M M M II I I I II M I MMMM I I II M M I II I 

Db 3 90 AAATCTATGACAAGTTC^GCAGAGTACACACAGC^TATATATGTTCTTC^CACATCAG 44 9 

Qy 426 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

lllllll MMMM II Mill II llllllll ,1 II MMI! IMIMI 

Db 4 50 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 50 9 

Qy 4 86 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

II llllil M MIMIIIII lllllllh Ml II MIMIM Ml III III MUM 

Db 510 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 569 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM MMMMMMMMMMMI MMI MMI M MMM I M MM 

Db 570 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 62 9 

Qy 6 06 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

II I IM 1 1 1 1 Ml 1 1 MM 1 1 ' II 1 1 Ml III Ml II II Mill IMIMI 

Db 63 0 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 68 9 

Qy 666 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I II MMMM IIIIMIMIIII lllllllllll M Mill M lllllll 

Db 6 90 TTAGCGCCCACTGCTCCTGTGACAGC^GGGATAACACACTGCAAGTGGACATCAACGGGT 74 9 

Qy 72 6 TCAATTCTGGCCGCGGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

Ml I I MMMM M I M II M M II II M II I M I II II M II M I II MM 

Db 750 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTTCTGC 8 09 

Qy 78 6 TCCTCATGGCCACCGCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I M I M I M 1 1 1 1 M I II 1 1 M II 1 1 1 1 1 1 1 1 1 Mill I f 1 1 1 1 1 1 1 1 1 1 f I MM 

Db 810 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGNCGAG 86 9 

Qy 84 6 CCCTGGATACCAAC 8 59 

llllll I II 

Db 870 CCCTGGNACACCAC 8 83 



RESULT 4 

BX335351/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

and Polayes,D. 



BASE COUNT 
ORIGIN 



BX335351 983 bp mRNA linear EST 01-MAY-2003 

BX335351 Homo sapiens PLACENTA COT 2 5 - NORMALI ZED Homo sapiens cDNA 
clone CS0DI013YH16 3-PRIME, mRNA sequence. 
BX335351 

BX335351.1 GI:30308367 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 983) 
Li,W.B., Gruber,C, Jessee,J. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 

http : //www. genoscope . ens . f r/cgi -bin/ cluster . cgi?seq=CSODI 013DD08NP1 
&cluster=9160 . r . Contact : Feng Liang Email : fliang@lifetech.com 
URL : http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODI 013DD08NP1 . 

Location/Qualifiers 

1. .983 

/organism^ "Homo sapiens 1 ' 
/mol_type= M mRNA" 
/db_xref ="taxon: 9606" 
/clone= " CSODI 0 13 YH1 6 " 

/tissue_type=" PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALI ZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
170 a 280 c 323 g 203 t 7 others 



Query Match 50.7%; 
Best Local Similarity 83.6%; 
Matches 859; Conservative 



Score 690.6; DB 13; 
Pred. No. l.2e-140; 
5; Mismatches 105; 



Length 983; 
Indels 59; 



Gaps 



5; 



Qy 



Db 



278 GGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACC 337 

Mill MM Mill MM MM III I MM! MM!!!!!!!!!!!!!!!!!!!! 

983 GGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCACC 924 



Qy 



Db 



33 8 CGCGTGCTAATGGTGGAAAGCGGC^CO^AATCTATGATAAATTCAAGGGCACCCCCCAC 397 

MM I MM llllllllll II MIMI I I Ml 

923 CGCGTGCTAATGGTGGARACCCACAACGAAATCTATGACAAGTTCAAGCAGAGTACACAC 8 64 



Qy 3 98 AGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTG 4 57 

III MINIM I MINIM II MINIM MINIM II INN II III 

Db 863 AGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTG 804 



Qy 

Db 


458 
803 


CTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 

Mill llllllllllllll IMIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIII 

CTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 


517 
744 


Qy 


518 


CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 

II MM II MM II Mill IIMMIMI 1 1 1 1 1 1 1 1 II 1 1 1 M 1 II 1 1 1 1 1 III 

CTGTACCAGAAATACAGCAACAATYCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 


577 


Db 


743 


684 


Qy 


578 


AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 

M Mill II MMM 1 M M 1 M M 1 1 1 1 1 1 1 1 II 1 M 1 1 1 M 1 1 1 II Ml 1 

AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 


637 


Db 


683 


624 


Qy 


638 


CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGAT 

II III II II II II Mil II MINIM 1 1 1 1 1 1 1 1 1 1 1 ! 1 III 

CGTGGAGGGGAAATTGA - GGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGAT 


697 


Db 


623 


565 


Qy 


698 


AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 

II II Mill II M M II II M 1 1 MMIMI 1 1 1 II E 1 1 1 1 1 1 1 1 1 

AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 


757 


Db 


564 


505 


Qy 


758 


ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 

Mill llllllllllllll MMIMI Ml 1 1 1 1 M 1 II 1 1 1 1 M 1 1 1 1 1 1 1 Ml 1 

ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 


817 


Db 


504 


445 


Qy 


818 


CACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTG 
M Mill 1 1 1 f 1 1 1 J f I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 i 

CATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 


877 


Db 


444 


409 


Qy 


878 


CCAGACTACGCATCTCTGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 

MMIMI 1 1 1 1 III i 1 Ml 1 1 1 1 1 M 1 1 II 1 

ACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 


937 


Db 


408 


376 


Qy 
Db 


938 
375 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

MIMIIIMMIIMI MMIMMIMM MMIMI llllllllllllll II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


997 
316 


Qy 


998 


GAACCCAAGGGCTACCATGCCAATTTGTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

II IIIIMMIIIIIIIIIMI 1:11 MMIMI II II M 1 1 MMIMI 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1057 


Db 


315 


256 


Qy 


1058 


GAC^CTC^GTACAGCTVAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 

Mill MMM: || 1 1| || | M II MIIIIMM III Mill IMM MMM 

GACA.CGCAGTAMMMCAAGGTCCTGGCCMTGTACAACCAGCATAACCC - GGCGCCTCGGCG 


1117 


Db 


255 


197 


Qy 


1118 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 
t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii 

1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 M II 1 II 1 1 II 1 1 M II M 1 M 

GCGCCGTNGT-CGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCTGC 


1177 


Db 


196 


138 


Qy 


1178 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTGCTGCAAGTGCAGCTGAGGC 

1 1 II II 1 1 M Mill IMI 1 II II Mil 1 II II 1 Ml , IIIIMMIIIIIIIIIMI 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGGT 


1237 


Db 


137 


78 


Qy 
Db 


1238 
77 


CCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCAC 

MIIMIMM II 1 M IIIIIIIIIMIIIIIIIIIMI MMM 1 

CCCGGCCCGCC CCGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCCCGCCCCCG 


1297 
23 



Qy 12 98 CGGGGCTG 13 05 

I I II 
Db 22 CTGCCTTG 15 



RESULT 5 

BX324511/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



GI :30332381 



(human) 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Homimdae; Homo. 

and Polayes,D. 



BASE COUNT 
ORIGIN 



BX324511 1041 bp mRNA linear EST 02-MAY-2003 

BX324511 Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALI ZED Homo sapiens 
cDNA clone CS0DC024YD20 3 - PRIME , mRNA sequence. 
BX324511 
BX324511. 1 
EST. 

Homo sapiens 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1041) 
Li,W.B., Gruber,C, Jessee,J. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160.r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0AC024DB10NP2&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0AC024DB10NP2 . 

Location/Qualifiers 

1. .1041 

/organism="Homo sapiens" 
/mol_type=" mRNA 11 
/db_xref ="taxon: 9606" 
/clone="CS0DC024YD2 0" 

/tissue_type=" NEUROBLASTOMA COT 25 -NORMALIZED" 
/clone__lib="Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a Not I -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
176 a 283 c 346 g 219 t 17 others 



Query Match 47.7%; Score 649.2; DB 13; Length 1041; 

Best Local Similarity 80.6%; Pred. No. 1.4e-131; 

Matches 870; Conservative 10; Mismatches 133; Indels 67; Gaps 8; 

Qy 2 79 GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCC 33 8 

= 11 I =1 I I I I I I I I I I' I I I II :h h I I I h ! I'M 

Db 1022 SGGGAGAKGSAGAACCGGAGCCCGACCCTRASCCR ACTAYACCCCAAGRAGTCACCC 966 



Qy 33 9 GCGTGCTAATGGTGGAAAGCGGC^CCAAATCTATGATAAATTC^AGGGCACCCCCCACA 3 98 



Db 965 GCGTGYTAATGKT -GAAACCCACAACGAAATCTATGAC7W3TTCAAGCAGAGTACACACA 907 

Qy 399 GCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGC 4 58 

II llllllll I llllllll II llllllll llllllll I I lllll II MM 
Db 906 GCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTGC 847 

Qy 459 TCTCTC-GGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 517 

I I I I I lllllllllllll I I I ' I I i ; I : I I I I I I I I I I I I MINI 

Db 846 TCTCCCSGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 787 

Qy 518 CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 577 

I I I I I M I I I I I I I I I I I I llllllllll I I I I I I I I I I I I I I I I I I , I I Ml 

Db 786 CTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 727 

Qy 578 AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 637 

I I Mill II MMM I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 M III I 

Db 726 AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 667 

Qy 638 CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGAT 697 

M III II II II II llllllll II llllllll lllllllllllll III 
Db 666 CGTGGAGGGGAAATTGA - GGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGAT 6 08 

Qy 698 AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 757 

llllllll II Mill II MMMMM I I llllllll 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 607 AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 54 8 

Qy 758 ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 817 

iiiii 1 1 1 1 1 1 f I r 1 1 1 1 1 iiiiiiii 1 1 r 1 1 ,i i u 1 1, 1 1 r 1 1 1 1 1 1 1 

Db 54 7 ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 4 88 

Qy 818 CACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTG 877 

II lllll I I I I I I ' I I I I I I I I I I I I I i I I ! I 
Db 4 87 CATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 4 52 

Qy 878 CCAGACTACGCATCTCTGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAAC 937 

iiiiiiii 1 1 r 1 1 r 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 51 ACCAACTATTGCTTCAGCTCCACGGAGAAGAAC 419 

Qy 938 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 997 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ' 1 1 ; i MMM I MMMIMIMM II 

Db 418 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 359 

Qy 9 98 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1057 

II I M M I II II II I MM I llllllll llllllll llllllll llllllll 

Db 3 58 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 2 99 

Qy 1058 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1117 

Mill 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 F MIMIIIIIIIII Mill Mill MMM 

Db 2 98 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC - GGCGCCTCGGCG 24 0 

Qy 1118 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1177 

llllllll II 1 1 1 1 I 1 1 1 M M II M i I 1 1 ; 1 1 1 1 ■ 1 1 . , , 1 1 1 1 1 1 1 1 1 : I I 

Db 23 9 GCGCCGTG - TGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 181 

Qy 1178 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGC 1237 

M M M 1 1 , 1 1 1 1 1 1 M 1 1 ' 1 1 M 1 1 llllllllll 1 I MMM I MMM . I M 1 1 



Db 18 0 AAGCCC^GGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTG(^AGTGCAGCTGAGGT 121 

Qy 12 3 8 CCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCT 1294 

llllllllllll II I II I I I Mill 1 1 1 1 MM III II 

Db 12 0 CCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCCCGCCCCCGCTGCC 61 

Qy 12 95 CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTGGGATCGATTAAA 134 9 

II I I I I I I I I E I I I I I I I I j I I llllllllllll MM I : II 

Db 60 TTGCCCATGGGGGCTGTATTTAAGGACACCCGTCCCCAAGCCCACCTGGSCACCCAYTAA 1 



RESULT 6 

BX434425/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX434425 888 bp mRNA linear EST 15-MAY-2003 

BX434425 Homo sapiens PLACENTA Homo sapiens cDNA clone CS0DE014YE16 
3 -PRIME, mRNA sequence. 
BX434425 

BX43442 5 . 1 GI : 3 07792 91 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 888) 

Li,W.B. ; Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CSOBAK028AB08NMl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAK028AB08NM1 . 

Locat ion/Qua 1 i f iers 

1. .888 

/organism="Homo sapiens " 

/mol_type="mRNA" 

/db_xref ="taxon: 9606" 

/clone=" CS0DE014YE16" 

/ 1 i s sue_t ype = " PLACENTA 11 

/clone_lib="Homo sapiens PLACENTA" 

/note= "Vector: pCMVSP0RT_6; 1st strand cDNA was primed 
with a NotI -oligo (dT) primer. Five prime end enriched, 
double-strand cDNA was digested with Not I and cloned into 
the Not I and EcoRV sites of the pCMVSPORT 6 vector. 
Library was not normalized." 
156 a 253 c 302 g 176 t 1 others 



Query Match 44.8%; Score 610.4; DB 13; Length 888; 

Best Local Similarity 82.6%; Pred. No. 4e-123; 



Matches 772; Conservative 0; Mismatches 102; indels 61; Gaps 



Qy 


428 


Db 


888 


Qy 


488 


Db 


829 


Qy 


548 


Db 


769 


Qy 


608 


Db 


709 


Qy 


668 


Db 


649 


Qy 


728 


Db 


589 


Qy 


788 


Db 


529 


Qy 


848 


Db 


472 


Qy 


908 


Db 


460 


Qy 


968 


Db 


400 


Qy 


1028 


Db 


340 


Qy 


1088 


Db 


280 


Qy 


1148 


Db 


220 


Qy 


1208 


Db 


160 



42 8 CTC CGGGAAG CGGTG C CGGAACCTGTATTG CTCTCT CGGG CAGAG CTG CG CCTG CTGAGG 487 

MM MMMI M IMM I Ml I I M I M M M I M I MIMM I 

8 88 CTCCAAAAAGCGGTACCTGAACCCTTGTTGTCTNCCCGGGCAGAGCTGCGTCTGCTGA-G 83 0 
CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 54 7 

MMMM Ml IMMMMMMMMM MMMMMMMMI MMMM 

CTCAAGTTTAAAATGGAGCAGCACGTGGAGCTGTACC^GAAATAC^GCAAC^TTCCTGG 770 
CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 607 

1 1 MMMMMIMMMMMM I Ml MM! M MMM I M MMM 

CGATACCTCAGCAACCGGCTGCTGGCACACAGCGACTCGCCAGAGTGGTTATCTTTTGAT 710 
608 GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 667 

MMMMMIMMMMMMI IM Ml Ml M II IMM MMMM 

709 GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 65 0 
AGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 72 7 

M MMMM MMMMMMI MMMMMI M MMI M MMMMI 

AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 590 
AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 787 

I I I MMMM IMMMMMMMMMI MMMMMMM MMMM 

ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 53 0 
CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 84 7 

MMMMMMMMMMMMMMMM MMI MMMMMMMMI 



CTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACC 907 

MMMM III 

GCCCTGGACACC 461 

AACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTC 967 

Mill 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 f 1 1 1 1 1 1 i 1 1 1 1 1 1 1 

AACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTC 4 01 
CGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGC 1027 

II MMMM MMMMMMM M II 1 1 II II 1 1 1 M M M I II II MMM 

CGC^AGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGC 341 
CTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTG 1087 

II MMMM llllllll llllllll Mill MMMMMMMIIMM Ml 

CTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTG 281 
TACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAG 114 7 

MMMMMI II 1 1 1 II 1 1 II I M I U , 1 1 1 1 II 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 ! I M ! 1 1 1 ! 

TACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAG 221 
CCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCC^GGTGGAGCAGCTGTCCAACATG 12 07 

II 1 1 1 II M 1 1 1 1 II M II 1 1 M 1 1 1 1 1 II I II 1 1 1 1 II I II 1 1 M 1 1 1 II 1 1 1 M 1 1 1 

CCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATG 161 
ATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGG 1267 

MMMM I II I II 1 1 1 1 1 1 1 1 II II II MIMMIMM II I II I I 

ATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGC 101 



Qy 1268 CAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA-T 1318 

i him mi 1 1 1 1 mi ii ii r 1 1 1 j 1 1 1 1 1 1 1 1 1 f i r 1 1 

Db 100 CCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACC 41 

Qy 1319 CGTGCCCCAAGCCCACTTGGGATCGATTAAAGCGG 1353 

I I I I I I I I I I ill I I, I I I I I II II I II II 
Db 4 0 CGTGCCCCAAGCCCACCTGGGCCCCATTAAAGAGG 6 



RESULT 7 
BM555996 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BM555996 1043 bp mRNA linear EST 20-FEB-2002 

AGENCOURT_65444 3 7 NIH_MGC_8 8 Homo sapiens cDNA clone IMAGE : 5550039 
5 ' , mRNA sequence. 
BM555996 

BM555996 .1 GI : 187 96907 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1043) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpubl ished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : / /image . llnl . gov 

Plate: LLAM12261 row: i column: 16 

High quality sequence stop: 702. 
Locat ion/Qua 1 if iers 
1. .1043 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 555 003 9" 

/tissue__type=" duodenal adenocarcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_8 8" 

/note="Organ: small intestine; Vector: pCMV-SP0RT6; 
Site 1: Not I; Site_2: Sail; Cloned unidirectionally ; 
oligo-dT primed. Average insert size 1.767 kb. Library 
enriched for full-length clones and constructed by Life 
Technologies. Note: this is a NIH_MGC Library." 
202 a 329 c 345 g 154 t 13 others 



Query Match 44 . 8%; 

Best Local Similarity 81.1%; 
Matches 736; Conservative 



Score 610.2; DB 12; Length 1043; 
Pred. No. 4,7e-123; 
0; Mismatches 120; Indels 51; Gaps 



1; 



Qy 3 96 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 4 55 

Mill llllllll I llllllll II llllllll llllllll II Mill M I 
Db 1 ACAGO^TATATATGTTCTTC^CACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 60 

Qy 4 56 TG CT CTCTCGGGCAGAG CTG CG CCTG CTGAGG CT CAAGTTAAAAGTGGAG CAG CACGTGG 515 

lllllll llllllllllllll : I i i I : I ' I I I I ' I I I I I I ! I II III II I I ! I 

Db 61 TG CTCT CCCGGGCAGAG CTG CGT CTG CTG AGGCT CAAGTTAAAAGTGGAG CAG CACGTGG 12 0 

Qy 516 AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 57 5 

I I I I Mlllllllllllllll Illlllllll M I I I I I I I ! I I I I I I I I I I I I | 
Db 121 AGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCAC 18 0 

Qy 576 CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 635 

1 1 1 1 Mill II MMM I II MIMMMMMMMMIMMIIMM III 

Db 181 CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 24 0 

Qy 63 6 CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAG 6 9 5 

III III II II Mill llllllll II IIIMIII lllllll Mill I 

Db 241 G CCGTGGAGGGGAAATTGAGGG CTTT CG C CTTAG CG CC CA CTG CT CCTG TGA CAG CAGGG 3 00 

Qy 696 ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 755 

MIMIIMI II Mill II MMMMM I I MINIM lllllllllllll 

Db 3 01 ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 360 

Qy 756 CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 815 

lllllll IMIIIIIMIIII IIIMIII IMMMMMMMMMMMIMM 

Db 3 61 CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 42 0 

Qy 816 AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACG 8 75 

MM Mill 1 1 1 1 1 1 1 1 i 1 1 M ! j ; I i 

Db 421 AGCATCTGCAAAGCTCCCGGCACCGCCGA 44 9 

Qy 876 TGCCAGACTACGCATCTCTGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGA 935 

llllllll llllllll MIMII lllllllllllll 

Db 4 50 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGA 48 9 

Qy 936 ACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTC 995 

1 1 II II M 1 1 1 M II 1 1 M IMIIIIMIIIII IIIMIII llllllllllllll I 

Db 4 90 ACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCC 54 9 

Qy 9 96 ATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCC 1055 

I II IIMIIIIIIIIIIIIIIII llllllll llllllll llllllll lllllll 

Db 55 0 ACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCC 609 



Qy 


1056 


TAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGG 

1 Mill MIIIIIIIIIIIMIIIM llllllllllllll MIMIIMM MM 

TGGACACGC^GTACAGC^GGTCCTGGCCCTGTAa^CCAGCATAACCCGGGCGCCTCGG 


1115 


Db 


610 


669 


Qy 


1116 


CGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCC 

1 1 1 1 II M 1 II M II 1 1 II 1 1 II 1 1 II 1 1 II 1 II MIMMIMMMMMMIMM 
CGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCC 


1175 


Db 


670 


729 


Qy 

Db 


1176 
730 


GCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAG 

M II II 1 1 1 1 II II 1 M 1 II 1 II 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 lllllll III 

GCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAGTGCACCTTGAA 


1235 
789 



Qy 1236 GCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTC 12 95 

I I II II I I II III I II I II II I 

Db 790 GGGTCCGGCCCCCACNCGGGNCNNNNNGGNNNNGGGGGGNCCCCGGCGCCGGGCCCCCGC 84 9 

Qy 12 96 ACCGGGG 13 02 

I II 

Db 8 50 GGCCCGG 856 
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BX383773/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
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AUTHORS 
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JOURNAL 
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ORIGIN 



BX383773 950 bp mRNA linear EST 08-MAY-2003 

BX383773 Homo sapiens HELA CELLS COT 25 -NORMALIZED Homo sapiens 
cDNA clone CS0DK001YA15 3-PRIME, mRNA sequence. 
BX383773 

BX383773 . 1 GI -.30457168 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 950) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : / /www. genoscope. ens . f r/ 

cgi-bm/cluster . cgi?seq=CS0DK001AA08NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DK001AA08NP1 . 

Locat ion/Qua 1 if iers 

1. .950 

/organi sm= " Homo sap i ens " 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" CS0DK001YA15" 

/cell_type="HELA CELLS COT 2 5 -NORMALIZED" 
/eel l__line=" HELA" 

/clone_lib="Homo sapiens HELA CELLS COT 2 5 -NORMALIZED" 
/note=" 1st strand cDNA was primed with a Not I -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
171 a 260 c 312 g 195 t 12 others 



Query Match 44.3%; 
Best Local Similarity 79.4%; 
Matches 791; Conservative 



Score 603.2; DB 13; Length 950; 
Pred. No. 1.5e-121; 
8; Mismatches 134; Indels 63; Gaps 5; 



Qy 


353 


GAAAGCGGCAACCAAATCTATGATAAATTO^GGGCACCCCCCACAGCTTATATATGCTG 

1 = 11 1 MM MMMMM M MMM 1 = 1 MMM MMMM 1 

GRAACCC^CAACGAAATCTATGACAAGTTCAAGC^GAGMACACACAGCATATATATGTTC 


412 


Db 


943 


884 


Qy 


413 


TTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAG 

IMMMI 1 1 MMMM MMMM M MMI M MMMM MMMMI 

TTCAAMACATCAGAGCTCCGAGAAGCGGTACCTGVACCCGTGTTGCTCTCCCGGGCAGAG 


472 


Db 


883 


824 


Qy 


473 


CTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATAC 


532 


Db 


823 


MMI MMMMMMMMMMMMIMMMMMMMM MMMMMM 

CTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATAC 


764 


Qy 


533 


AGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAG 

MMI M MMMI MMMMMMMMMMMI IMM MMI II III 

AGCAACAATAMCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAG 


592 


Db 


763 


704 


Qy 


593 


TGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATA 

im i ii 1 1 1 1 1 1 1 1 1 1 1 1 f r 1 1 1 1 1 1 1 r i r 1 1 1 1 1 1 mi mi mi ii ii 

TGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATT 


652 


Db 


703 


644 


Qy 


653 


GAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTG 


712 


Db 


643 


M 1 1 IMMMI II MMMM IMM! llllll III IMMM II III 

GA-GGCTTTCGCCTTAGCGCCCAGTGCTCCTGTTACAGCAGGGATTACACACTGCAAGTG 


585 


Qy 


713 


GAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAAC 

II M IMMMI 1 1 1 IMMMI 1 M 1 1 1 1 1 1 1 1 II II 1 1 1 II MMMMI 

GACATCAACGGGTTTACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAAC 


772 


Db 


584 


525 


Qy 


773 


CGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCC 

Mill llllllll 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIM llllll 
CGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCC 


832 


Db 


524 


465 


Qy 


833 


CGGCACCGCCGAGCCCTGGATACCAACAGCTACCCATACGACGTGCCAGACTACGCATCT 


892 


Db 


464 


MMMMMM 

CGGCACCGCCGA 


453 


Qy 
Db 


893 
452 


CTGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAG 

Ihlllll llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
- - -GCMCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAG 


952 
396 


Qy 


953 


CTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTAC 

M lllllhlllllll IIIIIIM IMIIIIIMIMI II II MMMMMM 

CTGTACATTKACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTAC 


1012 


Db 


395 


336 


Qy 
Db 


1013 
335 


CATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGC 

MMM 1 IMMMI MMMM MMMM MMMM Mill MMMMI 

CATGCCGACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGC 


1072 
276 


Qy 


1073 


AAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTG 

MIMIIMM 1 II III Mill 1 1 1 f 1 1 1 1 1 1 1 1 1 1 llllll 

AAGGTCCTGGCGCCTGTACAACCAGCATAACCCGGCGCCTCGGCGGCGCCGTG-TGCGTG 


1132 


Db 


275 


217 


Qy 


1133 


CCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAG 

1 1 II II 1 II M 1 1 M 1 1 IIIIMMMMMMIMIMM ' 1 1 1 1 1 1 1 1 ! 1 1 M 1 1 

CCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCKKCAAGCCCAAGGTGGAG 


1192 


Db 


216 


157 


Qy 


1193 


CAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACA 


1252 



Db 

Qy 
Db 

Qy 
Db 



156 



IMIIIIII IIIIIIIMIIII IMIIIIIIMIIIIMII MINIMUM 

CAGCTGTCCNACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGATCCCGCCCCGCCCCGC 



97 



1253 GCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGC CTCACCGGGGC 1303 

M I M I I Ml Ml MINIM I I M Mill 

96 CCCGCCCCGGAAGGCCGCGGCCCCACCCCGCCCCGCCCCCSCTGCCTTGCCCATGGGGGC 37 

1304 TGTATTTAAGGA - CATCGTGCCCCAAGCCCACTTGG 1338 

IIIIIIIMI II I II NIMNNII III 
36 TGTATTTAAGNACCCCCCTGGCCCAAGCCCACCTGG 1 
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AL530081 859 bp mRNA linear EST 23-MAY-2003 

AL530081 Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED Homo sapiens 
cDNA clone CSODD009YM06 5-PRIME, mRNA sequence. 
AL530081 

AL53 0081.2 GI : 31067916 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 859) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 13, 2001 this sequence version replaced gi: 12793574. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

http : //www. genoscope . ens . f r/ 

cgi -bin/cluster . cgi?seq=CS0DD0 0 9BG03QPl&cluster=916 0 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DD009BG03QP1 . 

Locat lon/Qual i f iers 

1. .859 

/organ ism= "Homo sapiens" 
/ mo 1 _t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/ c 1 one= " CS ODDO 0 9 YMO 6 " 

/tissue type= "NEUROBLASTOMA COT 50 -NORMAL I ZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/note="lst strand cDNA was primed with a Not I -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
179 a 272 c 257 g 148 t 3 others 



Query Match 



42.7%; Score 580.6; DB 9; Length 859; 



Best Local Similarity 87.3%; Pred. No. 1.3e-116; 

Matches 645; Conservative 2; Mismatches 91; Indels 1; Gaps 1; 



Qy 


6 


Db 


121 


Qy 


66 


Db 


181 


Qy 


126 


Db 


241 


Qy 


186 


Db 


301 


Qy 


246 


Db 


361 


Qy 


306 


Db 


421 


Qy 


366 


Db 


481 


Qy 


426 


Db 


541 


Qy 


486 


Db 


601 


Qy 


546 


Db 


661 


Qy 


605 


Db 


721 


Qy 


665 


Db 


781 


Qy 


725 


Db 


841 



CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II 1 1 1 1 1 1 1 II lllllllllll III MINIMI 1 1 1 1 1 1 1 1 1 1 M 1 1 I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 18 0 
TAGTG CTGACG CCTGG C CGG CCGGC CG C CGGA CTGT C CA C CTG CAAGAC CAT CGACATGG 12 5 

I .1111:111111 I. MM I I Mill II II 

TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 24 0 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

MIM Ml! MIMMMMI MM 1 1 1 1 1 1 1 lllllllllll MM 

AG CTGGTGAAG CGGAAG CG CATCGAGG CCAT CCG CGG C CAGAT CCTGT CCAAGCTG CGG C 3 00 
TTG C CAG C CC C C CGAG CCAGGGGGACGTG CCG CC CGG C CCG CTG CCTGAGG CAGTACTGG 24 5 

I I Mill 1 1 1 1 1 1 1 II I M I II! MIM Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 360 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II MM MM MM MM MM 1 1 1 II 1 1 II 1 1 II 1 1 1 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 4 2 0 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill 1 1 1 1 1 1 M 1 1 1 1 1 M M 1 1 1 1 Ml II 1 1 MM II ' 1 1 1 1 1 : I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 4 8 0 
AAATCTATGATAAATTC^GGGC^CCCCCCA^GCTTATATATGCTGTTO^CACGTCGG 4 25 

M II 1 1 II II II MM I I MMM MIM I MM II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGC^ 54 0 

AG CT CCGGGAAG CGGTG CCGGAA CCTGTATTG CTCT CT CGGG CAGAG CTG CG C CTG CTGA 4 8 5 

MM MIM II MIM II MMM IIIIIIIMIIMI MIM 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 60 0 
GGCTC^AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 II 1 1 1 1 1 1 1 II I M 1 1 1 1 1 II II 1 1 1 1 1 II II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 MMM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAA 660 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTG - GCTGTCCTTT 604 

MM 1 1 MM III II 1 1 M I M 1 1 1 II Mill Mill M Mill I I M III 

GGCGATACCTC^GCAACCGG CTG CTGGCACC CAG CGACTCGC CAGAG TGTGTTATCTTTT 72 0 
GATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGC 664 

M II II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 M III Ml III II II Mill Mill 

GATGTCACCGGAGTTGTGCGGCAGTKGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGC 780 
CTCAGTGCCC^CTGTTCCTGTGAC^GO^GATAAC^C^CTCC^CGTGGAAATTAACGGG 724 

M II MMM 1 1 1 1 1 1 1 ! 1 1 1 1 1 IIIIIIIIMI M MM II MMM 

CTTAGCGCCC^CTGCTCCTGTGACAGCAGGGATAACAC^CTGC^GTGRACATa^CGGG 84 0 



MM 



RESULT 10 



BE312000 

LOCUS BE312000 713 bp mRNA linear EST 26-OCT-2000 

DEFINITION 601154768F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 351 0592 5', 

mRNA sequence. 
ACCESSION BE312000 
VERSION BE312000.1 GI: 9130128 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 713) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM195 row: h column: 17 
High quality sequence start: 2 
High quality sequence stop: 713. 
FEATURES Loca t i on/ Qua 1 i f i er s 

source 1. .713 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db__xref = "taxon: 9606" 

/clones "IMAGE: 35105 92" 

/ 1 i s sue_t ype= " neurobl a s t oma " 

/lab_host="DH10B (phage-resistant ) " 

/clone_l ib= "NI H__MGC_1 9 " 

/not e=" Organ: brain; Vector: pOTB7; Site_l : Xhol ; Site_2: 
EcoRI; cDNA made by oligo-dT priming. Directional ly 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG (G) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 

BASE COUNT 157 a 226 c 211 g " 119 t 

ORIGIN 



Query Match 42.2%; Score 573.8; DB 10; Length 713; 

Best Local Similarity 87.8%; Pred . No. 3.7e-115; 

Matches 626; Conservative 0; Mismatches 87; Indels 0; Gaps 0, 
Qy 135 AGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCC 194 

Illlllllllllllllllllll IIIIIIIIIM IIIIIIIMII Mill lilllll 

Db 1 AGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCC 60 



Qy 195 CCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACA 254 

MMMMMIIMM 1 1 1 ! I j 1 1 1 1 1 1 ' 1 , 1 ' I Mill M II II II MM 

Db 61 CCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACA 12 0 



Qy 


255 


Db 


121 


Qy 


315 


Db 


181 


Qy 


375 


Db 


241 


Qy 


435 


Db 


301 


Qy 


495 


Db 


361 


Qy 


555 


Db 


421 


Qy 


615 


Db 


481 


Qy 


675 


Db 


541 


Qy 


735 


Db 


601 


Qy 


795 


Db 


661 



ACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 MINIM MM 1 1 1 M 1 1 1 1 1 1 1 1 1 ! 1 1 MIM I 

ACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCG 
ACTACTACGCCAAGGAGGTCACCCGGGTGCTAATGGTGGAAAGCGGCAACCAAATCTATG 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 II 1 1 1 1 1 1 1 1 1 II I M I M I MM MINIMI 

ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATG 
ATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGG 

I II MINI I I NUN MINIM I MIM! II MINIM I 

ACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAG 
AAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGT 

ININI II Mill II IIIIIMI 1 : 1 . 1 1 1 1 1 i 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGT 
TAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACC 

I I M 1 1 1 Ml 1 1 INI 1 1 1 IN M I I M M I MM I Ml II I 1 1 1 1 1 1 1 1 1 1 I M I 

TAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACC 



Ml 



II II 



INN II MM I N NIINNINN 



I I 1 1 1 ! 1 1 1 1 1 ! 1 1 . Ill IN IN II II INN I 



III 



II 



II II 



I II 



314 



180 



374 



240 



434 



300 



494 



360 



554 



420 



GCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGG 

MINN II II MM 1 1 MM 1 1 1, M NINNNNIN Mill 1 MINN 

GCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGG 
CCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 84 7 

1 1 1 1 1 M I II I ! MM 1 1 MM M I INN ININI I I WWW 

CCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCCGGCACCGCGAGCC 713 



794 



660 



RESULT 11 

BE260971 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



BE260971 717 bp mRNA linear EST 26-OCT-2000 

601153715F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 3509 93 1 5', 
mRNA sequence. 
BE260971 

BE260971 . 1 GI :9132709 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 717) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Unpubl ished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs - r@ma i 1 .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort lum/LLNL at: image.llnl.gov 
Plate: LLCM193 row. m column: 04 
High quality sequence stop: 713. 

Locat ion/Qua 1 i f iers 

1. .717 

/organism= "Homo sapiens" 

/mol_type= "mRNA" 

/db_xref="taxon: 9606" 

/ clone=" IMAGE : 3509931" 

/t issue_type= "neuroblastoma " 

/lab_host="DH10B (phage-resistant) " 

/clone_lib="NIH_MGC_19" 

/note="Organ: brain; Vector: pOTB7; Site_l : Xhol; Site_2: 
EcoRI ; cDNA made by oligo-dT priming. Direct ionally 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG ( G ) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 
158 a 226 c 213 g 120 t 



Query Match 41.6%; Score 565.8; DB 10; Length 717; 

Best Local Similarity 87.7%; Pred . No. 2.1e-113; 

Matches 629; Conservative 0; Mismatches 87; Indels 1; 



Gaps 



1; 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 



136 GCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCC 195 

1 1 1 II I II I . I . I Ml , II 1 1 I MUM MMMMMI MMI MMMM 

1 GCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCC 60 
196 CCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAA 2 55 

MMMMMI Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II M M II Mill 

61 CCCGAGCCAGG-GGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAA 119 
2 56 CAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGA 315 

Ml IIIIMIIIIIMI MMMM MM 1 1 II 1 1 II II 1 1 1 1 1 II Mill II 

120 CAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGA 17 9 
316 CTA CTACG C CAAGG AGGTCA C C CG CGTG CTAATGGTGG AAAG CGG CAAC CAAAT CTATGA 375 

1 1 M I M M I M M 1 1 M I M I M M M M M M I M M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

180 CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGA 23 9 
376 TAAATTC^GGGCACCCCC(^(^GCTTATATATGCTGTTC^CACGTCGGAGCTCCGGGA 435 

II MM I I MUM MMMM I MMMM M MMMM II 

24 0 CAAGTTCAAGC^GAGTACACACA^ 2 99 

436 AGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTT 4 95 

MUM II Mill II MMMM 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 



Db 



3 00 AGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTT 35 9 



Qy 4 96 AAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCT 55 5 

IIIIIIIIIIIIIIIIIIIIIIII lllllllllllllllll llllllllll Mill 

Db 360 AAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCT 419 

Qy 556 CAGCAACCGGCTGCTGGCCGCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGG 615 

IMIIIIIIIIIIIIIII Mill Mill II Mllll I M IMIIIMIIIIII 

Db 42 0 CAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGG 47 9 

Qy 616 AGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCA 675 

F 1 1 1 1 1 1 1 r 1 1 1 1 1 ] I Ml Ml Ml II II Mill llllllll II Mill 

Db 48 0 AGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTGGCCTTAGCGCCCA 53 9 

Qy 676 CTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGG 73 5 

Ml MIMMIMIM IIIIIIIIIM II Mill II llllllllll I I M 

Db 54 0 CTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGG 59 9 

Qy 736 CCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGC 79 5 

miiii i : 1 1 1 1 1 1 1 1 1 1 1 ^ 1 1 1 1 MiiMiiiiiiii 1 1 1 1 1 1 1 1 iiiiiiii 

Db 6 00 CCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGC 65 9 

Qy 7 96 CACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGA 852 

IIIIIIIIIIIIIIIIIIIIIIII Mill IIIIIIIIIIIIIII II III 

Db 660 CACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCGAGCCCTGGGA 716 



RESULT 12 

BQ675698 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BQ675698 902 bp mRNA linear EST 15-JUL-2002 

AGENCOURT_8036532 NIH_MGC_102 Homo sapiens cDNA clone IMAGE : 62 11917 
5', mRNA sequence. 
BQ675698 

BQ675698.1 GI:21786532 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 902) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpubl ished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : //image . llnl . gov 

Plate: LLCM2375 row: c column: 22 

High quality sequence stop: 599. 
Loca t ion/Qua 1 i f i er s 
1. .902 

/organism="Homo sapiens" 



/ mo 1 _ t yp e = " mRN A " 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 6211917" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_l ib= "NIH__MGC_1 02 " 

/note= " Organ : salivary gland; Vector: pOTB7; Site_l: Xhol ; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG(G). Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 

BASE COUNT 194 a 292 c 255 g 161 t 

ORIGIN 

Query Match 41.5%; Score 565.2; DB 13; Length 902; 

Best Local Similarity 87.0%; Pred. No. 3.1e-113; 

Matches 621; Conservative 0; Mismatches 93; Indels 0; Gaps 0; 
Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

i Minimi mi iiiii 1 1 1 1 1 1 1 1 1 1 1 1 e 1 1 1 1 1 r 1 1 1 r mm 

Db 1 TCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCCCCCGAGCCAGGGGGAGGTGC 6 0 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 275 

M III III III Mill M II II M 1 1 1 - 1 1 1 1 MIMMMIMM I 

Db 61 CGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAACAGCACCCGCGACCGGGTGG 12 0 

Qy 276 CCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCA 33 5 

1 1 E I E 1 1 MM I ! 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 Mill 1 1 i 1 1 1 1 1 : 1 1 1 1 ! 1 1 : 1 ! I 

Db 121 CCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCA 18 0 

Qy 336 CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCC 3 95 

1 1 1 : M 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 I MM IMIIMIM II MUM I I I 

Db 181 CCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAAGT^ 24 0 

Qy 3 96 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 4 55 

IIIII I Ml I I 1 1 1 i 1 1 1 1 II 1 1 1 ; I M I I II II Mill II I 

Db 241 ACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 3 00 

Qy 4 56 TGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 515 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III M ll 1 1 1 1 1 MM 1 1 1 1 1 1 M Ml 1 1 M 1 1 

Db 3 01 TG CTCTC C CGGG CAGAG CTG CGT CTG CTGAGG CT CAAGTTAAAAGTGGAG CAG CACGTGG 360 

Qy 516 AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 575 

MM 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 MMIIIMI M 1 1 MM 1 1 1 M M 1 1 II 1 1 1 I 

Db 3 61 AGCTGTAC^GAAATAC^GC^O^TTCCTGGCGATACCTCAGCAACCGGCTGCTGGCAC 42 0 

Qy 576 CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 63 5 

MM Mill II MM I II I II 1 I ' I I ' Ml 

Db 421 CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 480 

Qy 636 CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAG 695 

Ml Ml II II Mill MM II MMIIII 1 1 1 ! 1 1 1 1 1 ! 1 1 i I 

Db 481 GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 54 0 



Qy 696 ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 755 

llllllllll II Mill II I'll I I MINIM 1 1 1 1 1 1 1 M 1 1 1 1 

Db 541 ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGGCGCCGAGGTGACCTGGCCA 600 

Qy 756 CCATTCACGGCATGAACCGGCCCTTCGTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 815 

11)1111 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 601 CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 66 0 

Qy 816 AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACAGCTACCCAT 869 

MM MM I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I 

Db 661 AGCATCTGCCAAGCTCCCGGCACCGCCGAGCCCTGGGACACCACCTATTGCCTT 714 



RESULT 13 

BI818841 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI818841 925 bp mRNA linear EST 04-OCT-2001 

603037307F1 NIH_MGC_115 Homo sapiens cDNA clone IMAGE : 5178433 5', 
mRNA sequence. 
BI818841 

BI818841 . 1 GI : 15930391 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

1 (bases 1 to 925) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail . nih . gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : / / image . 1 lnl . gov 
Plate: LLAM11444 row: n column: 02 
High quality sequence stop: 874. 

Locat ion/Qua 1 if iers 

1. .925 

/organism="Homo sapiens" 
/mol__type= n mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5178433" 
/lab_host="DH10B" 
/clone lib="NIH _MGC_115" 

/note="Organ: pooled brain, lung, testis; Vector: 
pCMV-SPORT6; Site_l : Not I; Site_2: EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-27; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and direct ionally cloned (EcoRV site is 
destroyed upon cloning) . Average insert size 1.8 kb, 
insert size range 1-3 kb . Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 



021. Note: this is a NIH_MGC Library." 
BASE COUNT 170 a 316 c 278 g 161 t 

ORIGIN 



Query Match 39.5%; Score 537.6; DB 12; Length 925; 

Best Local Similarity 87.5%; Pred. No. 3.3e-107; 

Matches 611; Conservative 0; Mismatches 84; Indels 3; Gaps 2; 



Qy 


6 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 


65 


Db 


146 


II III MM II lllllllllll II 11:1111111 llllllllllllll 1 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


205 


Qy 


66 


TAGTG CTGACGC CTGG C CGG CCGGCCGC CGGA CTGTC CACCTG CAAGAC CAT CGA CATGG 

1 MM llllllll IIIIIMIIIM Mill llllllllllllll llllllllll 

TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 


125 


Db 


206 


265 


Qy 


126 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 MIIMMIII lllllllllll MM 

AG CTGGTG AAG CGG AAG CG CATCGAGG C CAT C CG CGG C CAGAT CCTGT C CAAG CTG CGGC 


185 


Db 


266 


325 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II M 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


245 


Db 


326 


385 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II llllllll IMIIMMIMM llllllll MM MIIMIIMMIMI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


305 


Db 


386 


445 


Qy 


306 


CAGAGGCGGACTACTACGCC^AGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 M 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


446 


505 


Qy 


366 


AAATCTATGATAAATTC^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

1 II II M 1 M II MMM 1 1 MUM llllllll 1 MMMM M 1 

AAATCTATGACAAGTTCAAGCAGAGTACACAC^ 


425 


Db 


506 


565 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

MMM MMMM 1 1 Mill II MMMM llllllllllllll IMMM 

ALjL 1 LLUALjAALLLtLj X ALL 1 LjAALLLLj X O 1 1 LrL 1 L X LLLLjUUL-AbAbL X LjLLj X L X LL X bA 


485 


uu 


jdd 


r o rr 

62 5 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGC^CGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMM 

GGCTCAAGTTAAAAGTGGAGCAGC^CGTGGAGCTGTACC^GAAATAC^Ga^C^TTCCT 


545 


Db 


626 


685 


Qy 


546 


GGCGCTACCTCAG CAAC CGG CTG C - TGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTT 

MM Mil MM Mill Mill M MMM 1 II Ml 

GGCGATACCTCAGCAACCGGCTGCTTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTT 


604 


Db 


686 


745 


Qy 


605 


GATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGA - - GAGGCTATAGAGGGTTTTC 


662 


Db 


746 


1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 M 1 II 1 1 1 Ml III II 1 1 II Mill MM 

GATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGACGGGGAACATTGAGGGCTTTC 


805 


Qy 


663 


GCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAAC 700 

MM II MMMM MMM Mill MM 1 

GCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATACC 843 




Db 


806 





RESULT 14 

AL530080/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AL530080 841 bp mRNA linear EST 23-MAY-2003 

AL530080 Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED Homo sapiens 
cDNA clone CS0DD009YM06 3 -PRIME, mRNA sequence . 
AL530080 

AL53 008 0.2 GI : 31067915 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 841) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 13, 2001 this sequence version replaced gi : 12793573. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

http : //www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DD009BG03NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DD009BG03NP1 . 

Loca t ion/ Qual i f i ers 

1. .841 

/ organ ism= " Homo sapiens " 
/mol_type= "mRNA" 
/db_xref="taxon: 9606" 
/clone=" CS0DD00 9YM06" 

/tissue_type= "NEUROBLASTOMA COT 50 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/note= M lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
150 a 240 c 283 g 153 t 15 others 



Query Match 38.6%; Score 52 5; DB 9; Length 841; 

Best Local Similarity 80.3%; Pred. No. 1.8e-104; 

Matches 689; Conservative 8; Mismatches 104; Indels 57; Gaps 5; 
Qy 43 8 CGGTGCCGGAACCTGTATTGCTCT-CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTA 4 96 

1 1 1 1 M hill I 111 = 111 I Mil 11.11 Mill MINIMUM 

Db 841 CGGTACCTGMACCCTTGTTGYTCTCCCCGGGCAGAGCTGCTTCTGCTAAGGCTCAAGTTA 782 

Qy 4 97 AAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTC 556 

I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I Ihlllllll llllll 
Db 781 AAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGC 722 



Qy 557 AGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGA 616 



Db 


721 


Qy 


617 


Db 


661 


Qy 


677 


Db 


603 


Qy 


737 


Db 


543 


Qy 


797 


Db 


483 


Qy 


857 


Db 


435 


Qy 


917 


Db 


414 


Qy 


977 


Db 


354 


Qy 


1037 


Db 


294 


Qy 


1097 


Db 


234 


Qy 


1157 


Db 


177 


Qy 


1217 


Db 


117 


Qy 


1277 


Db 


57 



llllllhll MINI Mill II II II II I II 1 1 1 1 MINIM 

AGCAACCSGCSGCTGGCACCCAGCSACTCGCCAGAGTGGTTTTCTTTTGTTGTCACCGKA 662 

GTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCAC 676 

1 1 1 1 1 1 1 II I II 1 1 1 III III III II II III MINIM II III II 

GTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTAAGG - - TTT CG C CTTAG CG CCNAC 604 

TGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGC 736 

I MM MUM M, Mill II III I II MINIM I I I Ml 

GGCTCCTGTAACAGC^GGGATAACACACTGCAAGTGAACATCAACGGGTTAACTACCGGC 544 

CGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGGC 7 96 

Mill I II 1 1 1 1 1 MM I II 1 1 M 1 1 M 1 1 1 1 i 1 1 . i I Mill 1 1 1 1 1 1 1 1 1 

CGCCGAGGTGACCTGGGCACCATTCTTGGCATGAACCGGCCTTTCCTGCTTCTCATGGCC 4 84 

ACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACC 856 

1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill I II 1 1 1 IM III I M II I 

ACCCCGCTGGAGAGGGCCCAGCTTCTGCAAAGCTCCCGGCACCGCCGA 436 



MINIM llllllll Ml 

- GCCCTGGACACCAACTATTGC 415 



TTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCGGGAAGGAC 976 

I II 1 1 II II I II 1 1 Ml II II 1 1 II III II 1 1 1 1 1 1 1 Mill MINN MINI 

TTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTAACTTCGGCAAGGAC 3 5 5 
CTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCC 103 6 

II llllll MINN II II I III III Ml M I II I II I M llllllll MINI 

CTCGGGTGGGAGTGGATCCACGAGCCCAAGGGCTWCCATGCCAACTTCTGCCTCGGGCCC 295 
TGTCCCTACATCTGGAGGCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAG 1096 

II MM II llllllll INN III I II 1 1 Ml II 1 1 M 1 1 llllll 

TGCCGCTACATTTGGAGCCTGGACACGCAGGACAGCAAGGTCCTGGCCCTGTACAACCAG 235 
CACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCGGCAGGCGCTGGAGCCACTGCCC 1156 

II MINN II I r 1 1 1 1 1 1 1 1 1 1 1 1 1 I M I M I M I M 1 1 II I II I II MINI 

CATAACCCGG- -GCCTCGGCGGCGCCGTGC-GCGTGCCGCAGGCGCTGGAGCCGCTGCCC 178 
ATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGT 1216 

I l MM MM IIMIIM : illlll II III MM I MINN 

TTCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGGGCGGCTGTCCAACATGGTCGTGCGC 118 



Ml MM II UN N 1 1 1 1 1 1 1 1 1 1 1 1 MINIM INN 

TCCTGCAAGGGCCGCTGGGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCi 
CCCCACCCCCGCCCGCCT 12 94 

Ml INI MM II 



RESULT 15 
BI084718/C 

LOCUS BI084718 956 bp mRNA linear EST 20-JUN-2001 

DEFINITION 602869722T1 NIH__MGC_102 Homo sapiens cDNA clone IMAGE : 501442 1 3', 
mRNA sequence. 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BI084718 

BI084718 .1 GI : 14503048 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 956) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih .gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: CLONETECH Laboratories, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLCM1821 row: 1 column: 06 
High quality sequence start: 3 
High quality sequence stop: 793. 
Locat ion/Qual i f iers 
1. .956 

/organi sm= " Homo sap i ens " 
/mol_type=="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5014421" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/ c 1 one_l ib="NI H_MGC_1 0 2 " 

/note= M 0rgan: salivary gland; Vector: pOTB7; Site_l: Xhol ; 
Site_2: EcoRI ; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG (G) . Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies) . Note: this is a NIH_MGC Library." 
172 a 273 c 312 g 199 t 



Query Match 38.5%; Score 524.6; DB 12; Length 956; 

Best Local Similarity 77.0%; Pred. No. 2.3e-104; 

Matches 757; Conservative 0; Mismatches 159; Indels 67; Gaps 



Qy 

Db 

Qy 
Db 

Qy 



32 9 GAGGT(^CCCGCGTGCTAATGGTGGAAAGCGGCAAC(^AATCTATGATAAATTCAAGGGC 38 8 

Mill I III 1 1 1 1 1 1 1 1 1 1 1 I I II II I 1 1 1 1 I 

955 GAGGTTAACCGGTGCTAAATGGTGGAAACCCACCACGAATCTATGACCAGGTTCAGCAGA 8 96 
38 9 ACCCCCCACAGCTTATATATGCTGTTCAACACGTC- -GGAGCTCCGGGAAGCGGTGCCGG 44 6 

II III Mill I MM II I MM III MM II 

8 95 GTACACACAGCCATAAATATGGTCTTCACACATTCAGGAGCTTCCGAGAACCGGTACCCT 83 6 
44 7 AACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTG CTGAGGCTCAAGTTAAAAGT 501 

I I II Ml II I II II I I 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 



Db 


835 


Qy 


502 


Db 


775 


Qy 


561 


Db 


715 


Qy 


619 


Db 


655 


Qy 


678 


Db 


595 


Qy 


738 


Db 


535 


Qy 


798 


Db 


475 


Qy 


858 


Db 


419 


Qy 


918 


Db 


406 


Qy 


978 


Db 


346 


Qy 


1038 


Db 


286 


Qy 


1098 


Db 


226 


Qy 


1158 


UD 


lbb 


Qy 


1218 


Db 


106 


Qy 


1278 


Db 


51 



835 GAACCGTGTTGGCTCTTCCCGGGGAAGAGCTGCCGTCTGCCTGAGGCTCAAGTTAAAAGT 77 6 

IGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCA 56 0 
I I Mllllllllllll II I I I llllllllll llllllllll 



ACCGGCTGCTGGCCCCCAGTGA - - CTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGT 618 

llllllll 1 1 1 1 lllll II II Mill I II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

ACCGGCTGGTGGCACCCAGCGAACTCGCCCAGAGTGGTTATCTTTTGATGTCACCGGAGT 656 
TGTGCGGCAGTGGCTGACCCGCAG - AGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 677 

1 1 1 1 1 1 1 1 1 1 1 1 1 III III I II II II Mill llllllll II lllllll 

TGTGCGGCAGTGGTTGAGCCGTGGAAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACT 5 96 
GTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCC 73 7 

I IIIIIIMIIIII MIIMIMM II lllll M llllllllll I I MM 

GCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCC 53 6 
GCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCA 7 97 

MM MMMIMMMMIMM MMIMMIMM 1 1 1 r 1 1 1 1 llllllllll 

GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 4 76 
CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCA 8 57 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 lllll MIMMIMIMMMMMIMM 

CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 4 2 0 

ACAGCTACCCATACGACGTGCCAGACTACGCATCTCTGGCCCTGGATACCAACTACTGCT 917 

llllllll MM 

ACCAACTATTGCT 4 07 

T CAG CT C CACGG AGAAGAACTG CTG CGTG CGG CAG CT CTACATTGACTTCCGGAAGGACC 977 

1 1 1 1 1 1 : 1 1 1 II! M I M I M I M M 1 1 1 1 III I h I MIIMIMM lllllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 34 7 
TGGGCTGGAAGTGGATTC^TGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1037 

I MUM Ml II II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 MM llllllll lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 28 7 
GTCCCTACATCTGGAGCCTAGAC^CTC^GTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1097 

I llllllll llllllll lllll 1 1 1 III I M I M II MM IIIIIIMIIIII 

GCCCCTACATTTGGAGCCTGGAC^CG(^GTACAGCAAGGTCCTGGCCCTGTAC^C(^GC 227 
ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 1157 

I 1 1 1 II I II II I Mill lllll M IMIM ' MM Mill lllllll 

ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 167 
TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1217 

I M 1 1 M I III I M 1 1 M I II 1 1 1 1 1 1 1 III I M I II II 1 1 II 1 1 II II M II 1 1 I 

T CGTGTA CTA CGTGGG C CG CAAG CC CAAGGTGGAG CAG CTGT C CAACATGATCG TG CG CT 107 
CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 77 

1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 MIIMIMM II I II lllllllllllllll 

CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC CCGCCCCGCCCCGGCAGGCCCGGC 52 



llllllll IMIM I II 



Search completed: October 28, 2003, 00:08:32 
Job time : 3384.39 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: October 27, 2003, 19:11:58 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-10-017-372E-39 
2169 

1 MAPSGLRLLPLLLPLLWLLV. 



; Search time 33.9534 Seconds 

(without alignments) 

1902.657 Million cell updates/sec 



. GRKPKVEQLSNMIVRSCKCS 4 07 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 



1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1107863 



Database : 



A_Geneseq_19Jun03 : * 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 

12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 



/ S I DS 1 / gcgda t a / genes eq/genes eqp - 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp- 
/SIDSl/gcgdata/geneseq/geneseqp- 
/S IDS 1 /gcgda ta/geneseq/genes eqp - 
/S IDS 1 /gcgda ta/geneseq/genes eqp - 
/SIDSl/gcgdata/geneseq/geneseqp- 
/SIDSl/gcgdata/geneseq/geneseqp- 
/S IDS 1 /gcgda ta /geneseq/geneseqp - 
/SIDSl/gcgdata/geneseq/geneseqp 
/ SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 



embl/AA1980 . 
embl/AA1981 . 
embl/AA1982. 
embl/AA1983 . 
embl/AA1984 . 
embl/AA1985 . 
embl/AA1986. 
embl/AA1987 . 
embl/AA1988 . 
-embl/AA1989 
-embl/AA1990 
-embl/AA1991 
-embl/AA1992 
-embl/AA1993 
-embl/AA1994 
-embl/AA1995 
-embl/AA1996 
-embl/AA1997 
-embl/AA1998 
-embl/AA1999 
-embl/AA2000 
-embl/AA2001 
-embl/AA2002 
-embl/AA2003 



DAT: * 
DAT: * 
DAT:* 
DAT: * 
DAT: * 
DAT: * 
DAT:* 
DAT: * 
DAT: * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 
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33 
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60 
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12 


AAR12541 


Latency associated 


34 


1282 .5 


59 


. 1 


458 


23 


ABG315 07 


LAP-mlFNB construe 


35 


1282 .5 


59 


. 1 


463 


23 


ABG31510 


LAP-huIFNB constru 


36 


1189 


54 


. 8 


290 


22 


ABG06792 


Novel human diagno 


37 


1155 


53 


. 3 


450 


23 


ABG315 08 


mIFNB-LAP construe 


38 


1138 


52 


. 5 


448 


23 


ABG31509 


huIFNB-LAP constru 


39 


944 


43 


. 5 


227 


22 


ABG20234 


Novel human diagno 


40 
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41 


. 5 


236 


22 


ABG20233 


Novel human diagno 


41 


887.5 


40 


. 9 
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21 


AAB08338 


Amino acid sequenc 


42 


887 . 5 


40 


. 9 
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23 


AAU77105 


Frog transforming 


43 


849 . 5 


39 
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19 


AAW78786 


Pig transforming g 
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AAR73598 
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AAR46228 


Pig TGF-beta-3. S 



ALIGNMENTS 



RESULT 1 
AAE13596 

ID AAE13596 standard; Protein; 390 AA . 
XX 

AC AAE13596; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) mutant. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 

KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 

KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 

KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; mutant; 

KW mute in. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 223 

FT /note= "Wild type Cys substituted with Ser" 

FT Misc-dif f erence 225 

FT /note= "Wild type Cys substituted with Ser" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 20-APR-2001; 2001WO-US12980 . 
XX 

PR 20-APR-2000; 2000US - 1 990 14P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR N-PSDB; AAD22 696 . 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Example 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD), under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 



CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is porcine TGF-betal mutant. 

XX 

SQ Sequence 3 90 AA; 

Query Match 94.0%; Score 2038.5; DB 23; Length 390; 

Best Local Similarity 95.3%; Pred. No. 1.7e-170; 

Matches 388; Conservative 0; Mismatches 2; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAI RGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I I I I I II I I I I II I I I I I I I I I I I I I II I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 2 83 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II 
Db 284 YCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLiALY 343 

Qy 3 61 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

IIIIIMI Mill Mill II IlilllllllllllllMIIII 

Db 344 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 2 
AAM39186 

ID AAM39186 standard; Protein; 390 AA. 
XX 

AC AAM39186; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 2331. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 



KW leukaemia . 
XX 

OS Homo sapiens . 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2000WO-US34263 . 
XX 

PR 21-JAN-2000; 2 0OOUS- 0488725 . 

PR 25-APR-2000; 2000US-0552317 . 

PR 09-JUL-2000; 2000US-0598042 . 

PR 19-JUL-2000; 2000US- 062 03 12 . 

PR 03-AUG-2000; 2000US-0653450 . 

PR 14-SEP-2000; 2000US-0662191 . 

PR 19-OCT-2000; 2000US-0693036. 

PR 29-NOV-2000; 2000US - 0727344 . 
XX 

PA (HYSE-) HYSEQ INC . 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F , Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C ( Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR N-PSDB; AAI58342. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Example 4; SEQ ID NO 2331; 10078pp; English. 
XX 

CC The invention relates to human nucleic acids (AAI57798 -AAI61369 ) and 

CC the encoded polypeptides (AAM38 642-AAM4 2213 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Act ivin/inhibin activity, chemotact ic/chemokinet ic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 3 90 AA; 

Query Match 89.7%; Score 1946.5; DB 22; Length 390; 

Best Local Similarity 90.4%; Pred. No. 2.1e-162; 

Matches 368; Conservative 10; Mismatches 12; Indels 17; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


284 


Qy 


361 


Db 


344 



I 1 1 1 1 1 1 1 1 1 ! 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 

MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAI RGQILSKLRLA 60 



120 



IIMIMM Mill IMIM I Mill MM I I I I I I I I ' I 1 M I , I I INI: I 



YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 ^ 1 1 1 

YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 
YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h I 1 1 1 1 1 1 1 II I II I M 1 1 1 1 = 1111 

YLSNRLIAPSDSPEWLSFDVTGVVRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNS YPYDVPDYASLALDTN 3 00 

: I I I I I I I I I I I I I II I I I II I 1 I I I I I I I I II I I I I Mill 
TGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 28 3 

YCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

II 1 1 M 1 1 1 III II 1 1 1 1 1 I II 1 1 MM M II Ml Ml M I M 1 1 1 1 M M 1 1 III 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 34 3 
NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 4 07 

i III II I II I II M II I II III II II II II M III 1 1 III 1 1 1 1 1 1 

NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 3 




AAP61468 




ID 


AAP61468 standard; Protein; 390 AA. 


XX 






AC 


AAP61468; 




XX 






DT 


31-OCT-2002 


(updated) 


DT 


28-OCT-1991 


(first entry) 


XX 






DE 


PreTGF-beta gene product. 


XX 






KW 


Transforming 


growth factor beta; cancer 


XX 






OS 


Unidentified 




XX 






FH 


Key 


Locat ion/ Qua 1 if iers 


FT 


Protein 


279. .390 


XX 






PN 


EP200341-A. 




XX 






PD 


10-DEC-1986. 




XX 






PF 


21-MAR-1986; 


86EP-0302112. 


XX 






PR 


22-MAR-1985; 


85US-0715142 . 


PR 


13-MAR-1987; 


87US-0025423 . 


XX 






PA 


(GETH ) GENENTECH INC. 


XX 







PI Derynck RMA; 
XX 

DR WPI; 1986-326875/50. 

DR N-PSDB; AAN60972. 
XX 

PT TGF-beta prodn. from transformed hosts - useful esp. for treating 

PT wounds <J6 2/9/86) . 

XX 

PS Disclosure; Fig lb; 26pp; English. 
XX 

CC The gene product is known to stimulate cell proliferation and 

CC inhibit anchorage -dependent growth of a variety of human cancer cell 

CC lines, it is esp . useful in treatment of burns and the promotion of 

CC surface and internal wound healing. TGF-beta may be expressed from a 

CC transformed CHO cell line. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 390 AA; 

Query Match 89.4%; Score 1939.5; DB 7; Length 390; 

Best Local Similarity 90.2%; Pred. No. 8.5e-162; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQILSKLRLA 60 

I Mllllllllllllllllllll 1 1 II I II ! II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 MM 1 1 1 1 M 1 1 II I ! M I! 1 1 II I II Mllllllllllllllllllll: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : 11 = 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

II 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 !! I M Mill I II! hi I hill 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

M I MM 1 1 1 1 1 1 1 1 1 M I hi h I MM I llllll MIM 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

I M MM 1 1 1 1 1 1 1 1 IM 1 1 1 1 M 1 1 II I III II 1 1 1 1 1 1 1 h I M II 1 1 M M 1 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 407 

I I MIM I 1 1 1 1 1 1 1 1 1 1 1 1 II I IM II 1 1 II h h 1 1 1 II I II 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 4 
AAR04034 

ID AAR04034 standard; protein; 390 AA. 
XX 

AC AAR04034; 



XX 

DT 25-MAR-2003 (updated) 

DT 31-OCT-2002 (updated) 

DT 31-MAY-1989 (first entry) 
XX 

DE Sequence of pre-TGF-beta 1. 
XX 

KW Transforming growth factor beta-3 (TGF beta 3) ; tumour cells; growth 

KW inhibition. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 348.. 500 

XX 

PN WO8912101-A. 
XX 

PD 14-DEC-1989. 
XX 

PF 08-JUN-1988; 88WO-US0194 5 . 
XX 

PR 08-JUN-1988; 88WO-US0194 5 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Dernyck RMA, Goeddel DV; 
XX 

DR WPI; 1990-007474/01. 

DR N-PSDB; AAQ02815. 
XX 

PT Nucleotide sequence encoding transforming growth factor beta-3 used as a 

PT probe, or to produce TGF beta 3, for inhibiting growth of certain normal 

PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. 2; 61pp; English. 
XX 

CC Sequence is an exon of transforming growth factor-beta 1 (pre-TGF-beta 

CC 1) polypeptide and corresponds to AA's 288-338 of mature TGF-beta 1. The 

CC nucleic acid encoding second subtype of TGF-beta (TGF-beta 3) is useful 

CC as a probe or to produce TGF-beta 3 for inhibition of normal and 

CC neoplastic cell growth. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 3 90 AA; 



Query Match 89.4%; Score 1939.5; DB 11; Length 390; 

Best Local Similarity 90.2%; Pred. No. 8.5e-162; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I I I I 1 1 i I I I I ' I I ' I I J 1 II I I I I I I I I I I I I I I II I I I M I II I , I I I I I I I li 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 1 h 1 1 1 1 1 1 1 1 II I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II MM I IN II Tl 1 1 h hi 



Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVXLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ml llilllllliill IMIIIMIMM IIIMI IMIIIhl! 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVE I NGFN 240 

I I I I I I I I I I I I I I I I I I I I I I I M I I h I II I I I I I II I I I I I : II I I Mill 
Db 181 YLSNRLLAPSDS PEWLSFDVTGWRQWLSRGGE I EGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I Mill 
Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 2 83 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

II I Mill Mill I III Mill Mill II I III III MM II Ml III I III I 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 344 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 5 




AAR05258 




ID 


AAR05258 standard; protein; 390 AA . 


XX 






AC 


AAR05258; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


05-AUG-1990 


(first entry) 


XX 






DE 


Human pre- transforming growth factor-beta- 1 (pre-TGF-beta- 1) 


XX 






KW 


Transforming growth factor-beta - 1 (TGF-beta-1) ; 


KW 


neoplastic cell line inhibition; 


KW 


EGF-potentiated anchorage- independent growth. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Loca t ion/ Qual i f iers 


FT 


Peptide 


1 . .278 


FT 


Protein 


279. .2011 


FT 


Domain 


8 . .23 


FT 




/note= "hydrophobic domain" 


FT 


Modif ied-site 


82 . . 84 


FT 




/note="potent ial N-glycosylat ion site " 


FT 


Modif ied-site 


136. .138 


FT 




/note="as above" 


FT 


Modif ied-site 


176. . 178 


FT 




/note="as above" 


FT 


Cleavage-site 


277. .278 


FT 




/note="proteolytic cleavage site" 


XX 






PN 


US4886747-A. 




XX 






PD 


12-DEC-1989. 





XX 

PF 13-MAR-1987; 87US- 0025423 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 

PR 22-MAR-1985; 8 5US - 07 1 5 142 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR N-PSDB; AAQ93301. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC Pre-TGF-beta-1 is a polypeptide of 390 amino acids. Post -transitional 

CC cleavage of the precursor gives rise to the mature TGF-beta monomer. 

CC The sequence for human TGF-beta was determined by direct amino acid 

CC sequence analysis and by deduction from the TGF-beta cDNA. It is 

CC capable of inducing EGF-potent iated anchorage -independent growth of 

CC target cell lines, and/or growth inhibition of neoplastic cell lines. It 

CC can be used for treating wounds, eg burns or epidermal ulcers. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 39 0 AA; 

Query Match 8 9.4%; Score 1939.5; DB 11; Length 390; 

Best Local Similarity 90.2%; Pred . No. 8.5e-162; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I I I I I : : l l • I l 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I I I : I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I = hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : I I ^ : I 1 1 ! I I I I I I I I I I I i I I I I I I I I i I I I I I I I I E 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

I I : I; I I I I I I I I I I I I I I I M I I I h lllllllllllllhll Ml 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

■mum i r i' ii , in Mini mil 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 



Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 407 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 6 
AAR13813 
ID 
XX 



AC 
XX 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 

XX 
SQ 



AAR13813 standard; Protein; 390 AA. 
AAR13813; 

20-NOV-1991 (first entry) 
Human pro-TGF-beta 1. 
Osteogenetic; tumoricidal . 
Homo sapiens. 



Key 

Peptide 
Peptide 
Peptide 

JP03180192-A. 

06- AUG-1991 . 

07- DEC-1989; 
07-DEC-1989; 



Location/Qualifiers 
1. .29 

/note= "signal peptide" 
30. .390 

/note= "pro-TGF-beta 1" 
279. .390 

/note= "TGF-beta 1" 



89JP-0318243 . 
89JP-0318243. 



(KIRI ) KIRIN BREWERY KK. 

WPI ; 1991-271579/37 . 
N-PSDB; AAQ13392. 



Human pro-TGF-beta 1 prodn. , for osteo-genet ic activity - by 
preparing DNA chain contg. base sequence coding for human 
pre: pro-TGF-beta 1, forming expression vector etc. 

Claim 1; Fig 1; 16pp; Japanese. 

The amino acid sequence codes for human prepro-TGF-beta 1 which 
can be produced by recombinant methods, it has osteogenetic and 
tumoricidal activity. 

Sequence 3 90 AA; 

Query Match 89.4%; Score 1939.5; DB 12; Length 390; 

Best Local Similarity 90.2%; Pred. No. 8.5e-162; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 



1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I h hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill = Ihll ,1 1 1 1 II 1 1 1 ! 1 1 1 1 , 1 1 1 1 1 1 1 1 1 1 1 IN i 1 1 1 1 1 il I : I h I I I 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I II h I I I I HIM 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

MIMIIMIIIIMIIIIIIIIIIIMIII IMIM Mill 

Db 241 TGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 2 83 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 3 61 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

II I I I I I I II I I I I I I II II I I I II I I I I I I I I I I I I I M I I I I I I I 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 7 




AAR73596 




ID 


AAR73596 standard; Protein; 


XX 






AC 


AAR73596; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


20-DEC-1995 


(first entry) 


XX 






DE 


Human TGF-beta 1 protein. 


XX 






KW 


Transforming growth factor-b 


KW 


TGF-beta 3; osteogenic cell 


KW 


bone -inducing 


cofactor . 


XX 






OS 


Homo sapiens . 




XX 






PN 


US5409896-A. 




XX 






PD 


25-APR-1995 . 




XX 






PF 


12-NOV-1993; 


93US-0132405 . 


XX 






PR 


01-SEP-1989; 


89US-0401906 . 


PR 


12-NOV-1991; 


91US-0790856 . 


PR 


18-MAY-1993; 


93US-0063841 . 


PR 


12-NOV-1993; 


93US-0132405. 


XX 







390 AA. 



PA (GETH ) GENENTECH INC. 
XX 

PI Ammann AJ, Rudman CG; 
XX 

DR WPI; 1995-169610/22. 
XX 



PT Compsn. for treating skeletal tissue deficiency - comprising 

PT transforming growth factor-beta and an osteogenic cell source in a 

PT carrier 

XX 

PS Claim 3; Column 15-18; 19pp; English. 
XX 

CC This sequence represents human transforming growth factor-beta 1 

CC (TGF-beta 1) . This sequence and the sequence for human TGF-beta 3 (see 

CC AAR73598) are claimed within the scope of the invention. The invention 

CC is a composition consisting of a TGF-beta protein and an osteogenic cell 

CC source (OCS) formulated in an acceptable carrier other than a bone 

CC morphogenic cofactor. This composition can be used for the restoration 

CC of bone deficiency. This provides for the generation of mature bone 

CC only where it is required, without the inclusion of a specific 

CC bone- inducing cofactor. This method can be used with any of the 5 human 

CC TGF-beta • s or with TGF-beta from other species. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 3 90 AA; 

Query Match 89.4%; Score 1939.5; DB 16; Length 390; 

Best Local Similarity 90.2%; Pred. No. 8.5e-162; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I 1 1 1 1 1 1 1 ' II 1 1 1 M 1 1 1 1 1 1 1 Mill I Mill III I II MM III Ml I MM III 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

MMIhll Mill Mill III II 1 1 MM II I i 1 1 1 1 1 1 1 IS I M 1 1 1 1 1 1 h hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : I MM I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I h I I I 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

I II 1 1 1 1 1 1 II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 IIMI MlllilMIII Mill 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

= 1 1 II I II I M I II 1 1 1 1 II I II I II II 1 1 1 M 1 1 II 1 1 1 1 1 

Db 241 TGRRGDLAT I HGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

II III II II Mill II MM III MM II Mill IMIMI MM Ml Ml I II Ml II 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 



344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 8 
AAR90827 

ID AAR90827 standard; Protein; 390 AA. 
XX 

AC AAR90827; 
XX 

DT 25-MAR-2003 (updated) 

DT 25-JAN-1980 (first entry) 

XX 

DE Pre-transf orming growth factor beta 1. 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Domain 8 . . 23 

FT /note= "hydrophobic domain" 

FT Modif ied-site 82.. 84 

FT /note= "potential N-glycosylation site" 

FT Modif ied-site 136.. 138 

FT /note= "potential N-glycosylat ion site" 

FT Modif ied-site 176.. 178 

FT /note= "potential N-glycosylation site" 

FT Cleavage-site 277.. 279 

FT /note= " trypsin- 1 ike peptidase cleavage site" 

FT Protein 279 . .390 

FT /label= mature_TGF_beta_l 
XX 

PN US5482851-A. 
XX 

PD 09-JAN-1996. 
XX 

PF 05-NOV-1993; 93US - 0 14 73 64 . 
XX 

PR 13-MAR-1987; 87US - 0025423 . 

PR 22-MAR-1985; 85US- 0715142 . 

PR 04-AUG-1989; 89US- 0389929 . 

PR 04-MAR-1992; 92US- 084 58 93 . 

PR 05-NOV-1993; 93US-0147364 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1996-076891/08. 

DR N-PSDB; AAT15720. 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 

PT using Chinese hamster ovary cells, for use in diagnostic applications 

PT or in therapy 
XX 

PS Example 3; Fig 1A-C; 2 6pp ; English. 
XX 



CC The pre -trans forming growth factor (TGF) beta 1 protein is encoded 

CC by AAT15720. The mature TGF beta 1 monomer is cleaved from the 

CC precursor at the Arg-Arg dipeptide immediately preceding the mature 

CC TGF-beta 1 NH2 -terminus . It does not contain a recognisable N-terminal 

CC signal peptide typical of most secreted proteins. The pre -TGF beta 1 

CC contains several pairs of basic residues which could undergo 

CC post-translational cleavage and give rise to separate polypeptide 

CC entities. The precursor contains 3 potential N-glycosylat ion sites, none 

CC of which are localised in the mature TGF beta 1. This is useful in 

CC purification of the mature protein. TGF beta 1 can be used in, e.g. wound 

CC healing. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 89.4%; Score 1939.5; DB 17; Length 390; 

Best Local Similarity 90.2%; Pred. No. 8.5e-162; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I lllllllllllllllllllllh hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll 1 1 1 1 1 1 1 1 1 1 1 1 II III I II II MINI III II I III I Ml hill 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFRLSAHCSCDSICDNTLHVEINGFN 240 

II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 h I 1 1 II 1 1 1 1 1 1 II I h I M I hllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLAT I HGMNR PFLLLMATPLERAQHLHS SRHRRALDTNS Y P YD VPD YAS LALDTN 300 

M II II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M II 1 1 1 1 Mill 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

1 1 1 1 1 ! 1 1 . 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M ; 1 1 1 1 1 1 1 1 , 1 1 1 1 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

i I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I 
Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 9 
AAU77101 

ID AAU77101 standard; Protein; 390 AA. 
XX 

AC AAU77101; 
XX 

DT 05-JUN-2002 (first entry) 
XX 

DE Human transforming growth factor beta 1 (TFG-beta-1) polypeptide. 



XX 

KW Human; transforming growth factor beta; TGF-beta; insulin production; 

KW type I diabetes mellitus; pancreatic cell outgrowth; wound healing; 

KW pancreatic duct tissue; ischaemia; stroke; nervous system aging; 

KW neurological condition; neurodegenerative disease; inflammation; 

KW vasal injury; chemical injury; traumatic injury; tumour- induced injury; 

KW amyotrophic lateral sclerosis; spinocerebellar degeneration; 

KW immunological disease; multiple sclerosis; TGF-beta- 1. 

XX 

OS Homo sapiens. 
XX 

PN WO200212336-A2 . 
XX 

PD 14-FEB-2002. 
XX 

PF 09-FEB-2001; 2 0 01WO-US04 1 92 . 
XX 

PR 09-AUG-2000; 2 000US- 0635368 . 
XX 

PA ( CUR I - ) CURIS INC. 
XX 

PI Wang M, Pang K; 
XX 

DR WPI; 2002-257468/30. 
XX 

PT Treating a subject with a disorder resulting from insufficient insulin 

PT production, and inducing outgrowth of pancreatic cells, involves using 

PT a transforming growth factor beta therapeutic 
XX 

PS Disclosure; Fig 1; 77pp ; English, 
XX 

CC The invention relates to treating a subject with a disorder resulting 

CC from insufficient insulin production, involving contacting the subject 

CC with a transforming growth factor beta (TGF-beta) therapeutic. TGF-beta 

CC polypeptides can be used for treating a subject with a disorder resulting 

CC from insufficient insulin production, e.g. type I diabetes mellitus, and 

CC for inducing outgrowth of pancreatic cells associated with pancreatic 

CC duct tissue within a subject, A composition comprising a TGF-beta protein 

CC may be useful in wound healing and treatment of neurological conditions 

CC derived from acute, subacute or chronic injury to the nervous system, 

CC including traumatic injury, chemical injury, vasal injury and deficits 

CC (such as ischaemia resulting from stroke) , together with 

CC infectious/inflammatory and tumour- induced injury, aging of the nervous 

CC system including Alzheimer's disease, chronic neurodegenerative diseases 

CC including Parkinson's disease, Huntington's chorea, amyotrophic lateral 

CC sclerosis, spinocerebellar degenerations and chronic immunological 

CC diseases of the nervous system or affecting the nervous system, including 

CC multiple sclerosis. This sequence represents the human TGF-beta-1 

CC protein. 

XX 

SQ Sequence 3 90 AA; 

Query Match 89.3%; Score 1936.5; DB 23; Length 390; 
Best Local Similarity 90.2%; Pred. No. 1.6e-161; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAI RGQILSKLRLA 60 



Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


284 


Qy 


361 


Db 


344 



I llllll MM IMMMIIMIIIM III IM Ml MMIM MMM lllllll 

MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 60 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IMIIhllllMIIIIMMIIIMMIMI III M II M II I II IM M I IM hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 
YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll I M 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 H 1 1 

YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

YLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEI NGFN 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I H lllllllllllllhllll hllll 
YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASIALDTN 3 00 

: 1 1 1 i ! 1 1 i I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; ! i 1 1 llllll Mill 

TGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

YCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

IMIIIIIIIMMMIIIIIIMMIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIII 

YCFSSTEKNCCWQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 
NQHN PGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 4 07 

M II M II i I II M M II II II III MMI Ml Ml M M M II III 

NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 10 
AAE16943 

ID AAE16943 standard; Protein; 390 AA. 
XX 

AC AAE16943; 
XX 

DT 18-APR-2002 (first entry) 
XX 

DE Human transforming growth factor-betal (TGF-betal) protein. 
XX 

KW Human; transforming growth factor-betal; TGF-betal; osteoporosis; 

KW latency associated peptide; LAP; integrin alphavbeta3; apoptosis; 

KW immunomodulation; inflammatory disease; fibrotic disease; cancer; 

KW diabetic retinopathy; chronic obstructive pulmonary disorder; 

KW bone resorption; rheumatoid arthritis; psoriasis; restenosis; 

KW atherosclerosis; liver fibrosis; asthma; cytostatic; osteopathic; 

KW ophthalmological ; ant iarteriosclerotic ; vasotropic . 
XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Peptide 1 . .29 

FT /label= Signal__peptide 

FT Region 30 . . 278 

FT /note= "LAP-betal" 

FT Domain 244 . . 246 

FT /note= " RGD motif" 

FT Protein 279 . .390 

FT /note= "Human mature TGF-betal protein" 



XX 

PN WO200190186-A1 . 
XX 

PD 29-NOV-2001. 
XX 

PF 25-MAY-2001; 200 1WO-GB023 52 . 
XX 

PR 26-MAY-2000; 2000GB- 00 12991 . 

PR 05-JAN-2001; 2001GB- 0000286 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 
XX 

PI Ludbrook S, Barry S, Horgan C, Miller D; 
XX 

DR WPI; 2002-097645/13. 
XX 

PT Identifying modulators of interactions between latency associated 

PT peptides and integrin alphavbeta3 for therapeutics, by contacting the 

PT peptide and integrin with a test product and determining if the product 

PT modulates interaction - 

XX 

PS Disclosure; Page 37-39; 44pp; English. 
XX 

CC The invention relates to a method for identification of a modulator 

CC of the interaction between latency associated peptide (LAP) of 

CC transforming growth factor-betal (TGF-betal) and integrin alphavbeta3 . 

CC The method is useful for identifying a modulator of the interaction 

CC between LAP and integrin alphavbeta3 . The method is useful for 

CC immunomodulation, in the treatment of inflammatory disease, fibrotic 

CC disease, cancer, diabetic retinopathy, bone resorption or osteoporosis, 

CC and for preventing apoptosis administering the modulator to the host. 

CC The modulator (inhibitor of the interaction between LAP-betal and 

CC integrin alphavbeta3) is useful in the manufacture of a medicament for 

CC immunomodulation . The modulator (activator of the interaction between 

CC LAP-betal and integrin alphavbeta3 ) is useful in the manufacture of 

CC medicament for preventing apoptosis. The modulator is useful for 

CC treating a inflammatory or fibrotic disease such as chronic obstructive 

CC pulmonary disorder, rheumatoid arthritis, psoriasis, restenosis, 

CC atherosclerosis, liver fibrosis and asthma. The present sequence is 

CC human TGF-betal protein. 

XX 

SQ Sequence 3 90 AA; 



Query Match 89.3%; Score 1936.5; DB 23; Length 390; 

Best Local Similarity 90.2%; Pred. No. 1.6e-161; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQILSKLRLA 6 0 

I I I I 1 I 1 I :! I i I' I M I II I !! M I I . I I I I I I' I I I II I I I II I I i I . I I I III 
Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

II II 1 1 = 1 1 1 II II 1 1 II II I M II I II II II IIIIIIIIMIIIIIIIIIIIh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I : I I I 



Db 



121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



Qy 



181 YLSNRLLAPSDS PEWLSFDVTGVVRQWLTRREAI EGFRLSAHCSCDSKDNTLHVE I NGFN 240 



Db 



181 YLSNRLLAPS DS PEWLS FDVTGWRQWLSRGGE I EGFRLSAHCSCDSRDNTLQVD I NGFT 24 0 



Qy 



241 SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 



Db 




Qy 



3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 



Db 




Qy 



361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 407 



Db 




RESULT 11 
ABB82780 

ID ABB82780 standard; Protein; 391 AA. 
XX 

AC ABB82780; 
XX 

DT 18-MAR-2003 (first entry) 
XX 

DE TGFB1 Arg25Pro polymorphism G-allele protein sequence. 
XX 

KW Polymorphism; transforming growth factor beta 1; TGFB1; TGFbeta-1; 

KW renal failure; nephrotropic ; human; allele. 

XX 

OS Homo sapiens. 
XX 

PN WO200290585-A2 . 
XX 

PD 14-NOV-2002. 
XX 

PF 08-MAY-2002; 2 0 02WO-GB02 066 . 
XX 

PR 09-MAY-2001; 2001GB-0011277 . 
XX 

PA (UYSH-) UN IV SHEFFIELD HALLAM . 
XX 

PI El-Nahas AM , Blakemore A, Khalil MS; 
XX 

DR WPI; 2003-120560/11. 
DR N-PSDB; ABV75391. 
XX 

PT Determining an individual's susceptibility to the progression of renal 
PT failure comprises detecting the presence of a genetic polymorphism 
PT pattern in transforming growth factor beta 1 (TGFB1) gene in a sample 
PT from the individual 
XX 

PS Claim 51; Page 59-61; 62pp; English. 
XX 

CC The invention relates to determining an individual's susceptibility to 



CC renal failure and invovles detecting the presence of a genetic 

CC polymorphism pattern in transforming growth factor beta 1 (TGFB1) gene in 

CC a sample from an individual, where polymorphism pattern is associated 

CC with renal failure. The method is useful for determining an individual's 

CC susceptibility to the progression of renal failure. The nucleic acid 

CC comprising a T(-509)C polymorphism of TGFB1 gene, or a polypeptide 

CC comprising a sequence of 391 amino acids is useful for preparing a 

CC medicament for retarding or preventing the progression of renal disease, 

CC and for drug research purposes for retarding or preventing the 

CC progression of renal disease. Sequences ABV75386-88 represents the 

CC protein sequence for the TGFB1 G-allele of the Arg25Pro polymorphism 

CC of exon 1 . 

XX 

SQ Sequence 3 91 AA; 

Query Match 89.3%; Score 1936; DB 24; Length 391; 

Best Local Similarity 90.2%; Pred. No. 1.7e-161; 

Matches 368; Conservative 10; Mismatches 12; Indels 18; Gaps 2; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MIMIMI MMM II II IMMIM MM llllllllllllllllllllll: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

Mill : I hi I I Mill II II INI III II II II I I MINIM! Ill MINI 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 18 0 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEI NGF 23 9 

1 1 1 II I II I MM 1 1 M 1 1 1 1 1 M I Ml M lllllllllllllhllll Ml 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 24 0 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDT 2 99 

M I I I II I I I I I I I I I II I I I II M I I I I I II I I I II I I I I 

Db 241 TTGRRGDLAT I HGMNR PFLLLMATPLERAQHLQSSRHRR ALDT 283 

Qy 300 NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 359 

I I I I I I I I I I I I I II I II I I I I I I I I I II I I II I I I II I I I I I I II I I II I I II I I I I I I 
Db 284 NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 343 

Qy 360 YNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 407 

1 1 1 1 1 1 1 ! 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 344 YNQHNPGASAAPCCVPQALEPLP I VYYVGRKP KVEQLSNM I VRSCKCS 391 



RESULT 12 
AAR20124 

ID AAR20124 standard; Protein; 390 AA. 
XX 

AC AAR2 0124; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 



DE Sequence of simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator. 
XX 

OS Monkey . 
XX 

FH Key 
FT Peptide 
FT Protein 
XX 

PN W09119513-A. 
XX 

PD 26-DEC-1991. 
XX 

PF 20-JUN-1991; 
XX 

PR 20-JUN-1990; 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Oleson FB, Comereski CR; 
XX 

DR WPI; 1992-024199/03. 
DR N-PSDB; AAQ20289. 
XX 

PT Use of transforming growth factor (TGF) -beta and their 
PT antagonists - for modulating blood pressure, for treating 
PT hypertension and hypotension 
XX 

PS Disclosure; Fig 1; 42pp; English. 
XX 

CC A new method for treating hypertension comprises administering a 
CC transforming growth factor (TGF) -beta to an individual at a dose 
CC effective for lowering blood pressure; the TGF-beta may be e.g. 
CC mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 
CC betal precursor, a latent TGF-beta2 precursor, hybrid TGF-beta 1 /TGF - 
CC beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 
CC complex. 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 3 90 AA; 



Locat ion/Qua 1 i f lers 
8 . .21 
279. .390 



91WO-US04449. 
90US-0541221 . 



Query Match 89.2%; Score 1934.5; DB 13; Length 390; 

Best Local Similarity 89.9%; Pred. No. 2.3e-161; 

Matches 366; Conservative 10; Mismatches 14; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I llllllllllllllllllll 1 1 II 1 1 1' 1 1 1 1 1 II I II I 1 1 1 MINIMI! I 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRKRI ETI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I : I I h I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I : hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

MMI = MMI 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQH VELYQKYSNNSWR 18 0 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

I M 1 1 1 1: 1': ! Ml 1 1 1 1 1 1 1 II 1 1 1 M IIIIIIIIIIIIIIMIII hllll 

Db 181 YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

: 1 1 1 1 : 1! Ill I II M 1 1 1 II 1 1 1 1 1 1 1 1 1 M II II I Mill 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

II I M 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 M M I M 1 1 1 1 1 M 1 1 1 M M 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKGS 407 

I I M 1 1 1 1 M I M 1 1 1 II 1 1 II M 1 1 1 1 1 1 II I II 1 1 1 1 1 1 II II M 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 13 




AAR46227 




1 u 


AAR46227 standard; Protein; 390 AA. 


yy 

-A. A. 








AAR46227; 




yy 

AA 






DT 


25-MAR-2003 


(updated) 


DT 


09-JUL-1994 


(first entry) 


yy 

A A 








Human pre-TGF 


-beta-1 . 


yy 

AA 






KW 


TGF-beta-1; TGF-beta-2; transforming growth factor beta-1; 


KW 


transforming growth factor beta-3; recombinant; wound heal 


KW 


vulnerary. 




XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Loca t ion/ Qua 1 i f i ers 


FT 


Peptide 


279. .390 


FT 




/label= Mat_peptide 


FT 


Cleavage-site 


279 


FT 




/note= "TGF-beta-1 release site" 


FT 


Modif ied-site 


82. .84 


FT 




/label= N-glycosylation site 


FT 


Modif ied-site 


136. .138 


FT 




/label= N-glycosylation_site 


FT 


Modif ied-site 


176. .178 


FT 




/ label = N-glycosylat ion_site 


XX 






PN 


US5284763-A. 




XX 






PD 


08-FEB-1994 . 




XX 






PF 


04-MAR-1992; 


92US-0845893 . 


XX 






PR 


22-MAR-1985; 


85US-0715142 . 


PR 


13-MAR-1987; 


87US-0025423 . 


PR 


04-AUG-1989; 


89US-0389929 . 


PR 


04-MAR-1992; 


92US-0845893 . 



XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynk RMA, Goeddel DV; 
XX 

DR WPI; 1994-056343/07. 

DR N-PSDB; AAQ56923. 
XX 

PT Nucleic acid sequences encoding transforming growth factor-beta - 

PT diagnostic probes, and for use in therapeutics 

XX 

PS Disclosure; Fig lb; 25pp; English. 
XX 

CC cDNA sequences were determined for human pre-TGF-beta - 1 (AAQ56923) , 

CC pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 

CC corresponding amino acid sequences were determined (AAR46227 -29 , 

CC respectively) . A genomic fragment corresponding to a human TGF- 

CC beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 

CC determined (AAR4 623 0) . The sequences have been used in the 

CC construction of vectors for the expression of recombinant TGF- 

CC beta. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 89.0%; Score 1930.5; DB 15; Length 390; 

Best Local Similarity 89.9%; Pred. No. 5.2e-161; 

Matches 366; Conservative 10; Mismatches 14; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I ll II I I Ml '. I I ' I I I I I I I I ' Mlllllllllllllllllllllllllllllllll 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 S PPSQGDVPPGPLP EA VLALYNSTRDRVAGES VEPE P E P EADYYAKEVTR VLM VESGNQ I 12 0 

I llllh III II II II I II III II MM II II I Mill MM I Ml I Ml MM Ml 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEREPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : IMM I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I = I I I 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 II I II h I 1 1 1 1 1 1 1 1 1 1 1 II M M 1 1 Mill I 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 

: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMI IMM 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 36 0 

I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 407 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 14 
AAR83054 

ID AAR83054 standard; Protein; 391 AA. 
XX 

AC AAR83 054; 
XX 

DT 25-JUN-1996 (first entry) 
XX 

DE Transforming growth factor-beta 1. 
XX 

KW macrophage inducible nitric oxide synthase; iNOS; constitutive NOS; 

KW interleukin-l-beta; transforming growth factor-beta; TGF-beta; ILl-beta; 

KW nitric oxide production; hypotension; inflammation; septic shock; 

KW treatment. 

XX 

OS Mammalian sp . 
XX 

FH Key Location/Qualifiers 
FT Protein 279. .391 

FT /note= "represents the mature active TGF beta-1 mol . " 

XX 

PN W09526745-A1 . 
XX 

PD 12-OCT-1995. 
XX 

PF 05-APR-1994; 94WO-US03 7 05 . 
XX 

PR 05-APR-1994; 94WO-US03 7 05 . 
XX 

PA (HARD ) HARVARD COLLEGE. 
XX 

PI Lee M, Perrella MA; 
XX 

DR WPI; 1995-358443/46. 
DR N-PSDB; AAT05876. 
XX 

PT Treatment of hypotension, esp. in septic shock - by administering 
PT transforming growth factor-beta e.g. to inhibit inducible nitric 
PT oxide synthase gene transcription 
XX 

PS Disclosure; Fig 17; 52pp; English. 
XX 

CC Transforming growth factor-beta 1 (TGF-beta 1) has been found to inhibit 

CC inducible nitric oxide synthase (iNOS) gene transcription, esp. in 

CC interleukin-l-beta (ILl-beta) stimulated rat smooth muscle cells, and at 

CC a dose which does not inhibit consitutive NOS. TGF-beta 1 or 2 

CC (AAR83055) or their active fragments (esp. derived from the 

CC carboxy- terminal 112 amino acids) , can be used in the treatment of 

CC hypotension, such as that associated with severe inflammation or septic 

CC shock. 

XX 

SQ Sequence 3 91 AA; 



Query Match 88.9%; Score 1929; DB 16; Length 391; 

Best Local Similarity 90.0%; Pred. No. 7.1e-161; 

Matches 367; Conservative 10; Mismatches 13; Indels 18; Gaps 2; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

I IIIIIIIIIIIMIIIIIIIII 1 1 1 1 1 1 1 1 1 II I II 1 1 ! 1 1 1 1 ! I M I M 1 1: II 

Db 1 MP PSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 : 1 M 1 1 1 M M 1 1 1 Ml MM Ml MM M III llh Ml 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

MMI : MM I MMMMMIMMMMMMI MM MM MM MM MM I 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 180 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGF 23 9 

IMMMMMMMMMMMMMMM MMMMMMMMMI MMM 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 240 

Qy 240 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDT 299 

MMMMMMMMMMIMMMMM MMM MM 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDT 283 

Qy 3 00 NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 359 

M MMM MMM M MM M MM M M MM MMM MMM MMM M MMM 

Db 284 NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 343 

Qy 360 YNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 07 

M M M M M M M M M M M M M M M M M M M M M M M M 

Db 344 YNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 391 



RESULT 15 




AAW7I 


3785 




ID 


AAW78785 standard; Protein; 390 AA. 


XX 






AC 


AAW78785; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


21-DEC-1998 


(first entry) 


XX 






DE 


Human pre-transf orming growth factor-beta 1. 


XX 






KW 


Transforming growth factor-beta 1; TGF-beta 1; human 


XX 






OS 


Homo sapiens . 




XX 






FH 


Key 


Locat ion/Qua 1 if iers 


FT 


Domain 


8 . .23 


FT 




/note= "hydrophobic domain" 


FT 


Protein 


279. .390 


FT 




/label= Mat_protein 


FT 


Modif ied-site 


82.. 84 


FT 




/note- "Asn is N-glycosylated" 


FT 


Modif ied-site 


136. .138 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


176. . 178 


FT 




/note= "Asn is N-glycosylated" 


FT 


Cleavage-site 


277. .278 



FT /note* "cleavage site for relase of TGF-beta 1" 
XX 

PN US5801231-A. 
XX 

PD 01-SEP-1998. 
XX 

PF 30-MAY-1995; 95US- 0454468 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 

PR 22-MAR-1985; 85US - 07 15142 . 

PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US- 0845893 . 

PR 05-NOV-1993; 93US - 0 14 7 3 64 . 

PR 30-MAY-1995; 95US- 04 54468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI ; 1998-494840/42. 

DR N-PSDB; AAV52933. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 2 6pp; English. 
XX 

CC This is the amino acid sequence of human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1) . It was deduced from 

CC a preTGF-beta 1 cDNA sequence (see AAV52 933) . The invention relates 

CC to the recombinant production of TGF-beta. Biologically active 

CC TGF-beta is defined as being capable of inducing EGF-potentiated 

CC anchorage independent growth of target cell lines and/or growth 

CC inhibition of neoplastic cell lines. Nucleic acids encoding 

CC TGF-beta have been isolated and cloned into vectors which are 

CC replicated in bacteria and expressed in eukaryotic cells. TGF-beta 

CC recovered from transformed cells is used in known therapeutic 

CC applications. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 88.9%; Score 1928.5; DB 19; Length 390; 
Best Local Similarity 89.7%; Pred. No. 7.9e-161; 

Matches 365; Conservative 10; Mismatches 15; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I II MM II II III Mill III I I I II I III Ml II MINIMUM Mill II I 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAPGLSTCKTIDMEQVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

II 1 1 M M 1 1 1 M II II II I II M 1 1 1 M I II I II 1 1 1 1 M I II 1 1 1 1 1 II 1 1 M Ml 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : IMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM 1 1 1 1 ' 1 1 ; 1 1 1 1 ! 1 1 1 1 ^ 1 1 1 



Db 



121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I I I = I I I I I : I I I I 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

: I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I II I I I 

Db 241 TGRRGDLATIHGMNRPFLLLiMATPLERAQHLQSSRHRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

! 1 1 1 1 1 1 1 1 1 . I ! : ' 1 1 . 1 1 1 I ' I > I 1 1 1 1 1 1 1 1 1 ! I 1 1 ! 1 1 1 . 1 1 1 1 1 ! I ; I 

Db 284 YCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLP I VYY VGRKPKVEQLSNM I VRSCKCS 4 07 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 344 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



Search completed: October 28, 2003, 09:06:49 
Job time : 34,9534 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein 
Run on: 



Title : 

Perfect score: 
Sequence : 

Scoring table: 



protein search, using sw model 

October 28, 2003, 09:09:54 ; Search time 24.6934 Seconds 

(without alignments) 
2760.110 Million cell updates/sec 

US-10-017-372E-39 
2169 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNM I VRSCKCS 4 07 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



629382 seqs, 167460630 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



629382 



Database 



Publ ished_Appl icat ions_AA : * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2 : / cgn2_6 /p t oda t a / 2 /pubpa a / PCT_NEW__PUB . pep : * 

3 : /cgn2_6/ptodata/2/pubpaa/US06__NEW_PUB.pep: * 
4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 
5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB . pep : * 
6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB . pep : * 



7 : /cgn2_6/ptodata/2 /pubpaa/US08_NEW_PUB . pep : * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB .pep: * 

10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 

11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 

12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB .pep: * 

13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 

14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 

15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 

16: / cgn2_6 /ptoda t a / 2 /pubpaa /US 1 0_NEW_PUB . pep : * 

17: / cgn2_6 /ptoda t a / 2 /pubpaa /US 6 0_NEW_PUB . pep : * 

18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 

US-09-214-592-26 

Sequence 26, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION : 
APPLICANT : Yamasaki , CMotoo 
APPLICANT: Shibata, CKenj i 
APPLICANT: Sato, CYasufumi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 26 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: porcine 
US-09-214-592-26 

Query Match 94.5%; Score 2050.5; DB 11; Length 390; 

Best Local Similarity 95.3%; Pred . No. 7.2e-183; 

Matches 388; Conservative 1; Mismatches 1; Indels 17; Gaps 1, 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I I MM l lllll MM I II I II MM MIIMI II Ml MIMIIMM lIMIIMI 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I II 1 1 1 M Ml II 1 1 1 1 M M 1 1 1 1 MM 1 1 II 1 1 ll 1 1 M M 1 1 1 III 1 1 Ml 1 1 1 1 1 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMLESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

II llllll II llllll M III MIMI I IMIIMI Ml MIMIIMI IMMMIM 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

'MMMMIMM I MM IMIMIII MM IM M !M ' 1 1 1 M M M MM MMI 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

MIIIIIIIIIIIIMIMIMIIIIIIMIIMMII I M II 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALY 3 60 

IMIIIMIIIMIIIIIIIIIIIIIIMIMIMMMIIIIMMMIIIIIIMIII 

Db 284 YCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALE PLP I VY YVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 M 1 1 S 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ( ! M ! M 1 M 1 1 1 

Db 344 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 2 

US-09-214-592-29 

Sequence 29, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION: 
APPLICANT: Yamasaki , CMotoo 
APPLICANT: Shibata , CKenj i 
APPLICANT: Sato,CYasufumi 

TITLE OF INVENTION : PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY-REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 2 9 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: ovine 
US-09-214-592-29 

Query Match 89.9%; Score 1949.5; DB 11; Length 390; 

Best Local Similarity 90.2%; Pred. No. 1.9e-173; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 

I IIIIIIIMIIIIIIIhllllll 1 1 1 1 M II I II 1 1 1 1 1 IIIIIIMIIIIIII 

MP PSGLRLLPLLL PLLWLLMLTPGR P VAGLSTCKT I DMEL VKRKG I EAI RGQ I LS KLRLA 6 

S P PSQGDVPPGPLP EAVLALYNSTRDR VAGES VE PE PE P EADY YAKE VTR VLMVESGNQ I 1 

lllllllllllllllhlllilllllllllll I I I II I I I I M 'I II I I I , I ||:| 
SPPSQGDVPPGPLPEAI LALYNSTRDR VAGES AETEPEPEADYYAKE VTR VLMVE YGNKI 1 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 1 

Mi i : Mil iiiiiilililllllllli-liiiiiiiliiiliiiiiMhiii 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 



II 



Db 



241 



SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR 



ALDTN 28 3 



Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

IIMIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMI 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

' ; 1 1 1 !i i, i ii: 1 1 m ii, 1 1 1 1 1 1 1 1 ii i 1 ii 1 1 n i ii i ! 1 1 , i 

Db 344 NQHN PGASAAPCCVPQALE PLP I VY Y VGRKPKVEQLSNM I VRS CKCS 390 



RESULT 3 
US-10-087-268-2 

; Sequence 2, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

; TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER : US/10/087,268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentln version 3.1 
/ SEQ ID NO 2 

LENGTH: 3 90 

TYPE: PRT 

ORGANISM: Human 
US-10-087-268-2 



Query Match 8 9.7%; Score 1946.5/ DB 15; Length 390; 

Best Local Similarity 90.4%; Pred. No. 3.7e-173; 

Matches 368; Conservative 10; Mismatches 12; Indels 17; Gaps 1; 



Qy 


i 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 

1 1 1 II ! 1 1 1 II 1 1 M M 1 ! M 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M 1! 1 1 ! 1 M 1 

MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 


60 


Db 


i 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

MlllhllllMIIMIIIIIIIIMIIIII lllllllllllllllllllllh hi 

S P PSQGEVP PGPLP EAVLALYNSTRDRVAGESAEP EPE P EADYYAKE VTRVLMVETHNE I 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

III II ■■ Ihll II II Ml Ml II Mill IN Ml MINIMI MM Mill hill 


180 


Db 


121 


YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKXiKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 

i M i i i i M i M M II 1 1 II 1 1 M 1 M h 1 1 1 1 1 1 1 II 1 1 1 1 1 h 1 1 1 1 h 1 1 1 1 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


240 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 


300 


Db 


241 


Mi 11 Ml 1 1 1 1 1 M 1 1 i Mi Ii; 1 Ml 1 III llllll MM) 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 


283 


Qy 


301 


YCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 


360 



IIMMIMMIIMMMMIIIII IIIMIIIIIIMMI MIIMIMMIIIMI 



Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 4 07 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I M I 
Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 4 
US-10-087-268-5 

; Sequence 5, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION : Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER : US/ 1 0/ 087 , 268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 5 

LENGTH: 390 

TYPE: PRT 

ORGANISM : Human 
US-10-087-268-5 



Query Match 89.4%; Score 1939.5; DB 15; Length 390; 

Best Local Similarity 90.2%; Pred . No. 1.7e-172; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 



Qy 


i 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQILSKLRLA 


60 


Db 


i 


1 llllllllllllllllllllll lllllllllllllllllllllllllllllllllll 
MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQILSKLRLA 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

1 III Ih 1 'll hi 1 1 1 1 1 1 1 1 Ml 1 1 Ml 1 ll III 1 1 1 1 ll I'L'M 1 II 1 : hi 
SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill : Ihll IIIIMIIIIIIIIIIIMIIIIIIIMIIIMIIIMMIhlll 


180 


Db 


121 


YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 

MIIIIMIIMIIIMIIIIIIIIIIhl IIMIIIIIIMIhllll hllll 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


240 


Db 


181 


240 


Qy 


241 


SGRRGDIATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYnVPDYASLALDTN 


300 


Db 


241 


mmmmmimimmmmmimimi iiiiii inn 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 


283 


Qy 


301 


YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 


360 


Db 


284 


MIMIMIIMIIIMMIIIMIIMIIIIIIMIIIIIIIIMIIIMIIMIIMI 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS KVLALY 


343 


Qy 


361 


NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 4 07 





lllllllllllllllllllll llllll llllllllllllllllllll 



Db 344 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 5 
US-10-276-947-1 

; Sequence 1, Application US/10276947 

; Publication No. US20030176315A1 

; GENERAL INFORMATION: 

; APPLICANT: GLAXO GROUP LIMITED 

; TITLE OF INVENTION: MEDICAL PRODUCTS 

; FILE REFERENCE: PG3 94 9 

; CURRENT APPLICATION NUMBER: US/10/276,947 

; CURRENT FILING DATE: 2002-11-21 

; PRIOR APPLICATION NUMBER: GB 0012991.6 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: GB 0100286.4 

; PRIOR FILING DATE: 2001-01-05 

; NUMBER OF SEQ ID NOS : 1 

; SOFTWARE: Patent In Ver . 2.0 

; SEQ ID NO 1 

LENGTH: 3 90 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-10-276-947-1 

Query Match 89.3%; Score 1936.5; DB 12; Length 390; 

Best Local Similarity 90.2%; Pred. No. 3.2e-172; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 60 

I Mill! 1 I II II 1 1 M I M 1 1 II II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKR I EAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MIMIMIMIMMMMMMMMIMI lllllllllllllllllllllh hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill = I h It IIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIhlll 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEVJLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

M I M 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 M I • I IMMIMIMIIhMII hMII 

Db 181 YLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATI HGMNR PFLLLMATPLERAQHLHSSRHRRALDTNS YPYDVPDYASLALDTN 300 

= 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINI Mill 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLOSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

I I M M II 1 1 1 1 1 M 1 1 M 1 1 1 1 M M I M M M I M I M M I M II I M 1 1 II II M 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRJCDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 3 61 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 1 1 ! 1 1 1 1 ! 1 1 1 1 1 i , 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 ! ' i 

Db 344 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 6 

US-09-214-592-33 

; Sequence 33, Application US/09214592A 
; Publication No. US20030027218A1 
; GENERAL INFORMATION : 

APPLICANT: Yamasaki , CMotoo 
; APPLICANT: Shibata , CKenj i 
; APPLICANT: Sa to , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/ 0 9/2 14 , 5 92A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 

SOFTWARE : 
; SEQ ID NO 33 

LENGTH: 3 90 

TYPE : PRT 

ORGANISM: simian 
US-09-214-592-33 



Query Match 89.2%; Score 1934.5; DB 11; Length 390; 

Best Local Similarity 89.9%; Pred. No. 4.9e-172; 

Matches 366; Conservative 10; Mismatches 14; Indels 17; Gaps 1; 



Qy 


l 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 


60 


Db 


l 


1 lllllllllllllllllllll 1 M 1 '1 1 1 II ' .'I ll ' III III llllllllllll 
MPPSGLRLLPLLLPLLWLLVTjTPSRPAAGLSTCKTIDMELVKRKR I ETI RGQI LSKLRLA 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 


120 


Db 


61 


lllllhllllllllllllllllllllllM IIIIIIIIIIMIIIIIIIMh hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill = Ihll IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIhlll 


180 


Db 


121 


YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 

lllllllllhllllllllllllMMhl IMIIMIIIIIMIIIM hllll 

YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 


240 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 

MIMMIIIIIIIIIIIIIIIIIIIIIMI llllll INN 


300 


Db 


241 


TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 


283 


Qy 


301 


YCFSSTEKNCCV11QLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYT WSLDTQYSKArLALY 

1 1 1 M ! 1 1 II 1 1 ! 1 1 1 1 ! 1 M 1 ! I (1 1 ! 1 1 1 1 1 ! I (1 1 M ! 1 1 ! II M M 1 1 (1 M II ! 1 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 


360 


Db 


284 


343 


Qy 


361 


NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

M 1 1 1 1 '1 1 1 1 1 1, 1 1 1 1 III 1 III II! '1 1 1 1 II 1 1 M i II 1 1 II 

NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 




Db 


344 





RESULT 7 



US-09-214-592-17 

; Sequence 17, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT : Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/214 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 17 

LENGTH: 3 91 

TYPE : PRT 

ORGANISM: human 
US-09-214-592-17 



Query Match 88.9%; Score 1929; DB 11; Length 391; 

Best Local Similarity 90.0%; Pred . No. 1.6e-171; 

Matches 367; Conservative 10; Mismatches 13; Indels 18; Gaps 2; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 

1 llllllllllllllllllllll IMIIIIMIIIIIIIIIIIMIMIIIIIIIIII 

MP PSGLRLLPLLLPLLWLL VT/TPGP PAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

Mill M Mill 1 1 1 III III 1 II MM M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 = hi 


120 


Db 


61 


SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 

Mill : 1 i : 1 1 MIMIIMMIMMIMMIM 1 1 1 II 1 1 M I M 1 1 1 1 1 h II 

YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 


179 


Db 


121 


180 


Qy 


180 


RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGF 


239 


Db 


181 


IMIMIIMIIIMIIIMMIIMIIhl IMIIIIIMIIIhllll MINI 

RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 


240 


Qy 


240 


NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDT 

: II 1 1 :i 1 II 1 1 1 MM 1 h 1 II Ml ; 1 1 1 mill MM 

TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDT 


299 


Db 


241 


283 


Qy 


300 


NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 

IIMMMMMMMMMMMMMMMMMIMMIIMMMMMMMIM 


359 


Db 


284 


NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 


343 


Qy 


360 


YNQHN PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VUSCKCS 4 07 

I M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 [ 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 

YNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 91 




Db 


344 





RESULT 8 

US-09-756-283A-23 

; Sequence 23, Application US/09756283A 



Patent No. US2002 0151478A1 
GENERAL INFORMATION: 
APPLICANT: Cherna j ovsky , Yuti 
APPLICANT: Dreja, Hanna St ma 
APPLICANT: Adams, Gillian 
TITLE OF INVENTION : Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 
CURRENT APPLICATION NUMBER: US/09/756 , 283A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 23 
LENGTH: 3 90 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-756-283A-23 

Query Match 88.8%; Score 1925.5; DB 10; Length 390; 

Best Local Similarity 89.7%; Pred. No. 3.4e-171 ; 

Matches 365; Conservative 10; Mismatches 15; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLT PGRPAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRLA 60 

I II II 1 1 M I II M II M 1 1 1 M 1 1 II 1 1 , 1 II M I II 1 1 I I I II I I I I I I I I II I II 

Db 1 MP PSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKT I DMELVKRKR I EAI RGQ I LS KLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 h 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHHEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : II II I I ' I I 1 I Ml I !' I II I I II I' I I I J I II I ll I I I I I I I h I I 
Db 121 YDKFKQSTHSTYMFFNISELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDS P EWLS FDVTG WRQWLTRREA I EGFRLSAHCS CDS KDNTLH VE I NGFN 24 0 

Ml II 11 MM MIMM MM I M III MM MIMMMIMIhlMI hllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 

^ , i ! ; 1 1 , i ; 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 i mi inn 

Db 241 TGRRGDLAT I HGMNRP FLLLMAT PLERAQHLQSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

II II I I II II I II I II II I II II II II II I I II II II II II I I II I II II I II II II II I 

Db 2 84 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 34 3 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

Mill MM MM Ml Mill Mill Ml II II MM MM II MM 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 9 

US-09-214-592-28 

; Sequence 28, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shiba ta , CKen j i 



; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER : US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 28 
LENGTH: 390 
TYPE: PRT 
ORGANISM: canine 
US-09-214-592-28 

Query Match 88.2%; Score 1913.5; DB 11; Length 3 90; 

Best Local Similarity 89.2%; Pred. No. 4.5e-170; 

Matches 363; Conservative 12; Mismatches 15; Indels 17; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I 1 1 II I ill 1 1 1 1 1 1 1 1 1 1 1 1 1 M Mill 1 1 ! 1 1 1 1 1 1 1 1 1 1 M 1 IN II 1 1 1 1 1 ! h 

MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLS 6 0 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MlllhIM lllllllllllllllllllll lllllllllllllllllllllh hi 

SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I :||hlllllMIIIIIIIIIIIIIIIIIIIIIIIII I I I I I I t I 1 1 I I I I I I 
YEKVKKS PHS I YMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 18 0 

YLSNRLLAPS DS PEWLS FDVTGWRQWLTRREA I EGFRLSAHCS CDS KDNTLHVE I NGFN 24 0 

I I I ,1 I I ! II h I I M I I I I I I I I II .h :|lllllll IMIIMII |:||||: 

YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

I : h I I I II I I ll I II II ; Ml I I ! I M I II I I I , II Mill 
SSRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRQRR ALDTN 283 

YCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALY 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 34 3 
NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


284 


Qy 


361 


Db 


344 



RESULT 10 
US-09-214-592-20 

; Sequence 20, Application US/09214592A 

; Publication No. US2003 00272 18A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

APPLICANT: Shibata , CKenj l 
; APPLICANT: Sato , CYasuf umi 



; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 2 0 

LENGTH: 390 

TYPE : PRT 

ORGANISM: murine 
US-09-214-592-20 

Query Match 85.5%; Score 1855.5; DB 11; Length 390; 

Best Local Similarity 85.5%; Pred. No. 1.2e-164; 

Matches 348; Conservative 15; Mismatches 27; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllllllllll I II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M I M I II M 1 1 1 II 1 1 1 1 1 II 1 1 II 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I ' |: I I I I I I I I I II I II I I I I I L I II ' : I I ! I 1 I I ' I I I . I I M ' M I ^ II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

hi I i nil i Ill-Ill MM III I II I II III Mill III II llhl M 

Db 121 YEKTKDISHSI YMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

II I II I h hi 1 1 III I MM INN = = hill II I Mill II I lllllll 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGI OGFRFSAHCSCDSKDNKLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

Mill III 1 I I I t I I I I I I I 1 I I I I I I 1 I i I I I 1 Mill 
Db 241 PKRRGDLGTI HDMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

II II I II II II I I I I I I II I I I II I M II II II I II M I I I II I I II II M I I I I II II I 
Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

[ 1 1 1 [ 1 1 1 1 = I ( 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 

Db 344 NQHN PGASAS PCCVPQALE PLP I VY YVGRKPKVEQLSNM I VRSCKCS 390 



RESULT 11 
US-09-214-592-23 

; Sequence 23, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT : Sa t o , CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 



TITLE OF INVENTION : OF SCREENING TGF- ACT I VI TY - REGULAT I NG COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER : US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 23 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: rat 
US-09-214-592-23 

Query Match 85.5%; Score 1855.5; DB 11; Length 390; 

Best Local Similarity 85.5%; Pred . No. 1.2e-164; 

Matches 348; Conservative 14; Mismatches 28; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I I I II I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I II I I I I I I I II I I I I I I I I I 
Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I M 1 1 hi ! 1 1 II II . I I I ! 1 1 II 1 1 1 1 1! I UN M I III I II I II II i M I I 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I Ihll lllhhllllll lllllllll I I IIIIIIMIIIIhlll 
Db 121 YDKTKDITHSI YMFFNTSDIREAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

II MM hhlMMMMMIMM : : hill I I ! I I I I I 1 I I lllllll 

Db 181 YLGNRLLTPTDTPEWLS FDVTG WRQWLNQGDG I QGFRFSAHCS CDS KDN VLHVE I NG I S 240 

Qy 241 SGRRGDLAT I HGMNR P FLLLMATPLERAQHLHS SRHRRALDTNS Y P YDVPDYAS LALDTN 300 

Mill III IIIIMIIIIIIMIMIIIMIIM Mill 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 2 83 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

Ml II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Mill MM Mill I II III lllllllllll 

Db 2 84 YCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 34 3 

Qy 361 NQHNPGASAAPCCVPQALEPLP I VY YVGRKPKVEQLSNMI VRSCKCS 4 07 

Ml 1 1 MM : ! 1 1 1 1 1 h 1 1 1 1 1 h 1 1 h II M M 1 1 1 II 1 1 1 M M 

Db 344 NQHNPGASASPCCVPQALEPLP IVY YVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 12 
US-09-214-592-25 

; Sequence 25, Application US/09214592A 
; Publication No. US20030027218A1 
; GENERAL INFORMATION: 

APPLICANT: Yamasaki , CMotoo 
; APPLICANT: Shiba ta , CKen j i 
; APPLICANT: Sa to , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 



CURRENT APPLICATION NUMBER : US/09/2 14 , 5 92A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 3 4 
SOFTWARE : 
SEQ ID NO 25 
LENGTH: 315 
TYPE: PRT 
ORGANISM: bovine 
US-09-214-592-25 

Query Match 73.4%; Score 1592.5; DB 11; Length 315; 

Best Local Similarity 89.5%; Pred. No. 3.1e-140; 

Matches 297; Conservative 9; Mismatches 9; Indels 17; Gaps 1; 
Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I III hi; Ml II I Ihllll I : Ihl I 

Db 1 AILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI YDKMKSSSHSI YMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

IMIIIIIIMIIIMMMIIMIIII II MM MMMIMIIMM IMMI 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 12 0 

Qy 196 LS FDVTG WRQWLTRREA I EGFRLSAHCS CDS KDNTLHVE I NGFNSGRRGDLAT I HGMNR 255 

MMMMIMMMI 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 LS FDVTGWRQWLTRRE EI EGFRLSAHCS CDS KDNTLQVD I NGFSSGRRGDLAT I HGMNR 180 

Qy 2 56 PFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQL 315 

I M M II M M 1 1 M M I II 1 1 II 1 1 1 1 M 1 1 1 M M 1 1 1 1 1 

Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQL 223 

Qy 316 YIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVP 375 

II MIMMMIMMMMMM III IMIMM 'M MMMM. Ill I 

Db 224 YIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVP 283 

Qy 376 QALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 II 1 1 1 1 1 M II I II M M 1 1 1 1 II 1 1 1 1 1 

Db 284 QALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 13 
US-09-756-283A-20 

; Sequence 20, Application US/09756283A 
; Patent No. US20020151478A1 
; GENERAL INFORMATION: 

APPLICANT: Cherna j ovsky , Yuti 

APPLICANT: Dreja, Hanna Stina 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/0 9/75 6 , 2 83A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS: 100 

SOFTWARE: Patent In version 3.0 
; SEQ ID NO 20 
LENGTH: 4 55 
TYPE: PRT 

ORGANISM: Artificial Sequence 



FEATURE : 

OTHER INFORMATION: LAP-ml FN construct 
US-09-756-283A-20 



Query Match 59-1%; Score 1282.5; DB 10; Length 455; 

Best Local Similarity 91.2%; Pred. No. 4.6e-lll; 

Matches 250; Conservative 10; Mismatches 13; Indels 1; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I i I I I I i I II II I I I I I I I I M I IIIMini II I III ! I II II 
Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I h I I I I I I I I I I I I I I I I II I I I I ' I I I I I I I I 'I I !l I I I I I I I I ll : hi 
Db 61 SPPSQGEVPPGPLPEAVXiALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

1 1 1 II : 11 = 11 1 1 1 ; 1 1 1 1 1 1 1 1 1 ' 1 1 1 III 1 1 1 !l II : I I I I I I ! I I M I M I 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGF 239 

1 1 1 1 ! 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h I 1 1 1 II I II I II 1 1 M M : II 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 24 0 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHS 273 

: 1 1 1 1 1 1 1 II II I M II I M 1 1 1 1 1 1 1 1 1 M I 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQS 274 



RESULT 14 
US-09-756-283A-22 

; Sequence 22, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

; APPLICANT: Dreja, Hanna Stina 

; APPLICANT: Adams, Gillian 

; TITLE OF INVENTION: Latent Fusion Protein 

; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/ 09/756 , 283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 22 

LENGTH: 44 7 

TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: ml FN -LAP construct 
US-09-756-283A-22 

Query Match 53.4%; Score 1159; DB 10; Length 447; 

Best Local Similarity 87.8%; Pred. No. 1.5e-99; 

Matches 230; Conservative 10; Mismatches 16; Indels 6; Gaps 3; 
Qy 14 PL - LWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLAS PPSQGDVPPGP 72 

II M I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M I , I I'l I M 1 1 i 1 1 1 1 1 M II 1 1 



Db 



190 PLGLW AGGGSAAALSTCKTIDMELVKRKRI EAIRGQILSKLRLASPPSQGEVPPGP 245 



Qy 



73 LPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLY 132 



Db 




Qy 



133 MLFNTSELREAVPEPVTjLSRAELRLL-RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSD 191 



Db 




Qy 



192 SPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEI NGFNSGRRGDLATI H 2 51 





Db 



3 6 6 SPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFTTGRRGDLATI H 42 5 



Qy 



252 GMNRPFLLLMATPLERAQHLHS 273 



Db 



426 GMNRPFLLLMATPLERAQHLQS 447 




RESULT 15 
US-09-214-592-32 

; Sequence 32, Application US/09214592A 
; Publication No. US20030027218A1 
; GENERAL INFORMATION : 

APPLICANT: Yamasaki , CMotoo 
; APPLICANT: Shiba ta , CKenj i 
; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/214 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 32 

LENGTH: 373 

TYPE : PRT 

ORGANISM: chicken 
US-09-214-592-32 

Query Match 49.2%; Score 1067; DB 11; Length 373; 

Best Local Similarity 53.5%; Pred. No. 4.6e-91; 

Matches 209; Conservative 52; Mismatches 98; Indels 32; Gaps 8; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

Mlh =h| hllllhllllllllll :|| : II llh I ll'h 
Db 2 LSTCQRLDLEAAKKKRI EAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES- VEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPV 148 



Db 



•■II- '1*111' I ' l II-- -II I I I 

62 QRARLRPPPDGPDEYWAKELRRI PMETTWDGAMEHWQPQSHSI FFVFNVSRARRG-GRPT 12 0 



Qy 



14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 2 03 

M llllhll I M :|||| I I Mill | : : |||||||| 

121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 18 0 



Db 



Qy 204 VRQWLTRREAI EGFRLSAHCSCD SKDNTLHVE I NGFNSGRRGDLATI HGMNR - - PFL 258 

I II h I : hi I II h : : I II :||h :| =| h = 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRI S I EGFEQ- QRGDMQS I AKKHRRVPYV 239 

Qy 2 59 LLMATPLERAQHLHSSRHRRALDTNS YP YDVPDYASLALDTN YCF - - SSTEKNCCVRQLY 316 

I M I III llhl II MM : Mill; || 

Db 24 0 LAMALPAERANELHSARRRR DLDTDYCFGPGTDEKNCCVRPLY 2 82 

Qy 317 IDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQ 376 

MM I 1 1 1 , 1 1 1 1 MIMMMIMM I II M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

Db 283 IDFRKDLQWKWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQ 342 

Qy 377 ALE PLP I VYYVGRKPKVEQLSNM 1 VRS CKCS 4 07 

M-MMMIIM MIIIMhlhllll 

Db 34 3 TLDPLP I I YYVGRNVRVEQLSNM WRACKCS 373 



Search completed: October 28, 2003, 09:28:02 
Job time : 24.6934 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 28, 2003, 00:37:16 ; Search time 13.4491 Seconds 

(without alignments) 
2910.285 Million cell updates/sec 

US-10-017-372E-39 
2169 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 4 07 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database : 



PIR_76 : * 
pirl : * 

pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 
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ALIGNMENTS 



RESULT 1 



A27512 

transforming growth factor beta-1 precursor - pig 
N;Alternate names: TGF-beta 

C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 05-Jun-1988 #sequence_revision 05-Jun-1988 #text_change 16-Jul-1999 

C; Accession: A27512; A26356; 146657 

R;Derynck, R. ; Rhee, L. 

Nucleic Acids Res. 15, 3187, 1987 

A;Title: Sequence of the porcine transforming growth factor-beta precursor. 
A;Reference number: A27512; MUID : 87174844 ; PMID-.3470708 
A /Access ion : A27512 
A /Molecule type: mRNA 
A/Residues : 1-390 <DER> 

R;Cheifetz, S.; Weatherbee, J. A.; Tsang, M.L.S.; Anderson, J.K.; Mole, J.E.; 
Lucas, R.; Massague, J. 
Cell 48, 409-415, 1987 

A;Title: The transforming growth factor-beta system, a complex pattern of cross- 
reactive ligands and receptors. 

A/Reference number: A90890; MUID : 87102890 ; PMID:2879635 
A /Access ion : A2 6356 
A/Molecule type: protein 
A/Residues: 279-322 <CHE> 

R/Kondaiah, P./ Van Obberghen- Schilling, E. ; Ludwig, R.L.; Dhar, R./ Sporn, 
M.B. ; Roberts, A.B. 

J. Biol. Chem. 263, 18313-18317, 1988 

A/ Title: cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

Evidence for alternate splicing and polyadenylation . 

A /Reference number: 146657/ MUID : 89054010 ; PMID:2461367 

A /Access ion : 14 6657 

A/Status : preliminary/ translated from GB/EMBL/DDBJ 
A/ Molecule type: mRNA 
A /Residues: 1-390 <K0N> 

A/ Cross-references : GB:M23703/ NID:g755044 ; PIDN : AAA64616 . 1 / PID:g755045 
C; Genetics : 

A /Gene: TGFB / TGF -beta-1 
C; Superf amily : inhibin 
C/ Keywords: growth factor 



Query Match 94 . 7%; 

Best Local Similarity 95.6%/ 
Matches 389/ Conservative 



Score 2 053.5; DB 2/ 
Pred. No. 2.1e-158; 
0/ Mismatches 1; 



Length 3 90/ 
Indels 17; Gaps 



1; 



Qy 
Db 

Qy 
Db 

Qy 

Db 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I IIIIMIIIMIIIMIIIIMIIIMMIIIIIIIIIIIMIMIIIIIIIIIIIII 

1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 



61 



121 



120 



SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

I I I I I I I I I I M Ml I III I M I I ! I I I! I M I I I I ! Ml I I I I M I I II I I I I I I I I I 

61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 



180 



YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

1 . 1 1 II il 1 1 II II I III II < II II I M M 1 1 1 1 I ! In I > I L , I > i; 1 1 1 1 1 1 1 1 

121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

1 1 1 1 1 1 1 1 1 M M II I III M 1 1 II MM M 1 1 1 1 M I Mill M 1 1 M II 1 1 M Ml II 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 24 0 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 

1 1 ii 1 111 1 1 1 1 1 1 1 1 1 ' i M ii 1 1 1 1 1 ii 1 1 1 ri i inn 

Db 241 SGRRGDLAT I HGMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 28 3 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

I I I I I M I I I I I I I I I I I 1 1 I 1 1 I 1 1 I II I! I I I I I I I II I I I I I II I ; I I I I I : I I 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 407 

II III III 1 1 1 1 1 M I M I M II 1 1 1 1 Ml II 1 1 M M II 1 1 Ml 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 2 
S01413 

transforming growth factor beta-1 precursor - chicken 
C;Species: Gallus gallus (chicken) 

C;Date: 30-Jun-1989 #sequence_revision 30-Jun-1989 #text_change 24-Nov-1999 
C; Access ion: SO 14 13 

R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A . B . 
Nucleic Acids Res. 16, 8730, 1988 

A/Title : Nucleotide sequence of chicken transforming growth factor-beta 1 (TGF- 
beta 1) . 

A;Reference number: S01413; MUID : 88335639 ; PMID:3166520 
A, -Access ion: SO 14 13 
A /Molecule type: DNA 
A;Residues: 1-391 <JAK> 

A; Cross-references : EMBL:X12373; NID:g63808; PIDN : CAA3 0933 . 1 ; PID:g63809 
C; Super family: inhibin 
C;Keywords: growth factor 

Query Match 93.2%; Score 2022; DB 2; Length 391; 

Best Local Similarity 94.6%; Pred . No. 7.6e-156; 

Matches 386; Conservative 0; Mismatches 4; Indels 18; Gaps 2; 
Qy 1 MAPSGLRLLPLLLPLLWLL VLTPGRPAAGLSTCKT I DMELVKRKRI EAI RGQ I LSKLRLA 60 

I Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 MP PSGPGLLPLLLPLLWLLVLT PGR PAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

Mill I MM llllllll INI llllll lllllll IIIIIMIIMI llllllllll III 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWG 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEIN-GF 239 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 II 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINAGF 240 

Qy 24 0 NSGRRGDLAT I HGMNRPFLLLMATPLERAQHLHS SRHRRALDTNSYPYDVPDYASLALDT 2 99 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

Db 241 NSGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRR ALDT 283 



QY 



3 00 NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 359 



M I II M II M 1 1 III II MM II II M M MM II M M MM I II II II 1 1 II II II I 

Db 284 NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLAL 343 

Qy 360 YNQHN PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 407 

h I I I : I I I I I I I I II II I II I I I I I , I I Ml I I I I I I ll! I I II I 
Db 344 YNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 391 

RESULT 3 
146463 

transforming growth factor beta-1 - sheep 

C;Species: Ovis orientalis aries, Ovis ammon aries (domestic sheep) 

C;Date: 19-Dec-1997 #sequence_revision 19-Dec-1997 #text_change 24-Nov-1999 

C;Accession: 146463; S45115 

R;Woodall, C.J.; McLaren, L.J.; Watt, N.J. 

Gene 150, 371-373, 1994 

A;Title: Sequence and chromosomal localisation of the gene encoding ovine latent 
transforming growth factor-beta 1. 

A; Reference number: 146463; MUID : 95121932 ; PMID:7821809 
A; Access ion : 14 64 63 

A/Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-390 <WOO> 

A; Cross -references : EMBL:X76916; NID:g4 9664 8 ; PIDN : CAA54242 . 1 ; PID:g4 9664 9 
A;Note: submitted to the EMBL Data Library, December 1993 
C;Superf amily : inhibin 

Query Match 89.9%; Score 1949.5; DB 2; Length 390; 

Best Local Similarity 90.2%; Pred. No. 5.6e-150; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTl DMELVKRKRI EAI RGQI LSKLRLA 60 

I M M I II M M M III MIMM I MMMMMMMIM MMMMMMMI 

Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRKGI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MMMMMMMMMMMMMMMM I MMMMMMMMMM MM 

Db 61 SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Ml I : MMI Mill Mill II II I II IM M I Ml MM MM I II 1 1 II M Ml 

Db 121 YDKMKSSSHSI YMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii j 1 1 1 r 1 1 1 r 1 1 1 1 1 1 1 1 f i MiiiM 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

Ml M II M IM M Mill MM II I II II I III II M MMI 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 36 0 

M II M M IM II M M M MIMI 1 1 II I M I M M IM MM II IM M Ml M III 

Db 2 84 YCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 34 3 

Qy 3 61 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

II I M . M I M III II M MM M M 1 1 II III M i MIIIM II 



Db 344 NQHNPGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 4 
WFHU2 

transforming growth factor beta-1 precursor [validated] - human 
N; Alternate names: growth-inhibitory factor; TGF type 2; TGF-beta 
C; Species: Homo sapiens (man) 

C;Date: 28-Feb-1986 #sequence__revis ion 19-Oct-1995 #text_change 08-Dec-2000 
C;Accession: A27513; A01395; A22290; 159664; S53444 
R;Derynck, R . ; Rhee, L. ; Chen, E.Y.; Van Tilburg, A. 
Nucleic Acids Res. 15, 3188-3189, 1987 

A;Title: Intron-exon structure of the human transforming growth factor-beta 
precursor gene. 

A;Reference number: A27513; MUID : 87174845 ; PMID : 3470709 
A; Accession : A27 513 
A; Molecule type: DNA 
A;Residues: 1-390 <DER> 

A; Cross-references : GB:X05839; GB:Y00112; NID:g37097; PIDN:CAA29283 . 1; 
PID:gl212989 

R;Derynck, R. ; Jarrett, J. A.; Chen, E.Y.; Eaton, D.H.; Bell, J.R.; Assoian, 
R.K.; Roberts, A.B.; Sporn, M.B.; Goeddel , D.V. 
Nature 316, 701-705, 1985 

A; Title: Human transforming growth factor-beta complementary DNA sequence and 
expression in normal and transformed cells. 
A;Reference number: A01395; MUID : 85296301 ; PMID:3861940 
A; Access ion: AO 13 95 
A; Molecule type: mRNA 

A; Residues: 1-9, 'P* ,11-24, 'P',26-159,'R' ,160-3 9 0 <DE2> 

A; Cross-references : GB:X02812; GB:J05114; NID:g37092; PIDN : CAA26580 . 1 ; 
PID:g37093 

A;Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino- terminal signal peptide 

R;Massague, J.; Like, B. 

J. Biol. Chem. 260, 2636-2645, 1985 

A;Title: Cellular receptors for type beta transforming growth factor. Ligand 

binding and affinity labeling in human and rodent cell lines. 

A; Reference number: A22290; MUID : 8513 1019 ; PMID:2982829 

A; Access ion: A222 90 

A;Molecule type: protein 

A;Residues: 279-295 , 'XX 298 -301 <MAS> 

R;Urushizaki , Y. ; Niitsu, Y.; Terui , T.; Koshida, Y.; Mahara, K. ; Kohgo, Y.; 
Urushizaki, I.; Takahashi, Y. ; I to, H. 
Tumor Res. 22, 41-55, 1987 

A; Title: Cloning and expression of the gene for human transforming growth 
factor-beta in Escherichia coli. 
A;Reference number: 159664 
A; Access ion : 159664 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 279-390 <RES> 

A; Cross-references : GB:M38449; NID:g339557; PIDN : AAA36735 . 1 ; PID:g339558 
R;Stam, K. ; Stewart, A. A. ; Qu, G.Y.; Iwata, K.K.; Fenyoe, D. ; Chait, B.T.; 
Marshak, D.R.; Haley, J.D. 
Biochem. J. 305, 87-92, 1995 

A;Title: Physical and biological characterization of a growth- inhibitory 
activity purified from the neuroepithelioma cell line A673 . 



A;Reference number: S53444; MUID : 95126934 ; PMID:7826358 
A /Accession : S534 44 
A; Status: preliminary 
A;Molecule type: protein 
A;Residues: 279-297 <STA> 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Genetics : 

A; Gene: GDB : TGFB1 ; TGFB 

A; Cross -references : GDB: 120729; OMIM: 190180 
A ; Map position: 19ql3 .2-19ql3 . 2 
C; Super family : inhibin 

C; Keywords: glycoprotein; growth factor; homodimer; mitogen; transformation 
F; 1- 18 /Domain : signal sequence #status predicted <SIG> 
F; 19-278/ Doma in : propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F; 279-390/ Product : transforming growth factor beta-1 #status experimental <MAT> 
F; 82 , 136, 176/Binding site: carbohydrate (Asn) (covalent) ttstatus predicted 

Query Match 89.3%; Score 1936.5; DB 1; Length 390; 

Best Local Similarity 90.2%; Pred . No. 6.3e-149; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIMII I I I I 'I I I M M I I II I II , I I I I I, I i I III I L'l I M I I I I I I II I I! 
Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I , II I H I I I I II; I I I I I I I I U I ' II I I I lllllllllllllllllllllh hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill = II ' I I I II 'II I II , II, II I II I I II II 'II, I I I III M'll I I h III 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

I I M I : II ; I I I IN I I I II I I I I III I-" I h I h I II I hi hi I I I I : I I I I 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

•• 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 Mill 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 3 60 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 5 
A26960 

transforming growth factor beta-1 precursor - green monkey 
C; Species: Cercopithecus aethiops {green monkey, grivet) 



C;Date: 05-Oct-1988 #sequence_revi s ion 05 -Oct - 1 988 #text_change 24-Nov-1999 
C; Access ion: A2 6960 

R;Sharples, K. ; Plowman, G.D.; Rose, T.M. ; Twardzik, D.R.; Purchio, A.F. 
DNA 6, 239-244, 1987 

A; Title: Cloning and sequence analysis of simian transforming growth factor-beta 
cDNA . 

A;Reference number: A26960; MUID : 87246074 ; PMID:3474130 
A; Access ion: A2 6 96 0 
A; Molecule type: mRNA 
A; Residues -. 1-390 <SHA> 

A; Cross-references : GB:M16658; NID:gl76552; PIDN : AAA35369 . 1 ; PID:gl76553 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F; 1-16/Domain: signal sequence #status predicted <SIG> 

F; 17 -3 90/ Product : transforming growth factor beta tfstatus predicted <MAT> 

Query Match 8 9.2%; Score 1934.5; DB 2; Length 390; 

Best Local Similarity 89.9%; Pred. No. 9.1e-149; 

Matches 366; Conservative 10; Mismatches 14; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I M II 1 1 II M M I II 1 1 II II IMMMIMMIIMIMIMI MNIMIMM 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRKRI ETI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

' i II I M M 1 1 1 1 III 1 11 1 lr I: I M 1 1 1 1 '. 1 1 1, 1 1 1 1 1 1 1 ! 1 1 II I: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll Ml I II I ll I II I ll I ,1 I I I I M I I !l I I I I I I I I I Mil III 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDS PEWLSFDVTG WRQWLTRREA I EGFRLSAHCSCDSKDNTLHVE I NGFN 240 

MIIIIIIIMIIIIIIIMIIIIIIIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINI 

Db 181 YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

= 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 ; I I 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 3 60 

I MM M I M 1 1 1 MM 1 1 II II Ml II II Ml II M M MIMM Ml I II II I II II I 

Db 284 YCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 407 

II I MM MMM MMI II II 1 1 M M M I; MM MM M M M 

Db 344 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM I VRSCKCS 390 



RESULT 6 
JC4 023 

transforming growth factor beta-1 - dog 
C;Species: Cams lupus familiaris (dog) 

C;Date: 13-Jun-1995 #sequence_revision 14-Jul-1995 #text_change 24-Nov-1999 
C; Access ion: JC4 023 

R; Manning, A.M. ; Auchampach, J. A.; Drong, R.F.; Slightom, J.L. 
Gene 155, 307-308, 1995 



A/Title : Cloning of a canine cDNA homologous to the human transforming growth 
factor-beta 1 -encoding gene. 

A; Reference number: JC4023; MUID: 95237630 ; PMID:7721110 
A; Accession : JC4023 
A;Molecule type: mRNA 
A;Residues: 1-390 <MAN> 

A; Cross -references : GB:L34956; NID:g516071; PIDN : AAA51458 . 1 ; PID:g516072 

C; Comment: This factor plays a multifunctional role as a regulator of mammalian 

cell growth and as a modulator of immune responses. 

C; Genetics : 

A;Gene: tgf-betal 

C; Super family: inhibin 

C; Keywords: growth factor; transforming protein 

F;288-390/Product : transforming growth factor beta 1 #status predicted <MAT> 

Query Match 88.2%; Score 1913.5; DB 2; Length 390; 

Best Local Similarity 89.2%; Pred. No. 4.6e-147; 

Matches 363; Conservative 12; Mismatches 15; Indels 17; Gaps 1 ; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 60 

I MILIUM 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 Ml ! I Ml < III 1 1 1 1'l I h 

Db 1 MP PSGLRLLPLLL PLLRLLVLTPGRPAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRLS 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MIMIMM MMMMIMMIMI Ml II I M I MM I III I IM II I M M 

Db 61 SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Ml I M MM I II I Ml II 1 1 M 1 1 IM I M 1 1 M 1 1 Ml I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 YEKVKKSPHSI YMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

Ml M 1 1 1 M M 1 1 MMM II I Ml 1 1 M M M 1 1 M 1 1 1 M 1 1 1 1 II 1 = 1111- 

Db 181 YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

I I I I I I I I I II I I I I I I I 1 I I I I I I I I I I I I I I I I I MMI 
Db 241 SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRR ALDTN 2 83 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 3 60 

I M 1 1 II I II I II 1 1 1 II II II I II I M M 1 1 1 1 1 1 1 1 MM II Ml I III I II I M I II 

Db 2 84 YCFSSTEKNCCWQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALY 34 3 

Qy 3 61 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

II 1 1 III I M 1 1 1 1 1 1 MM II I M II IM IM 1 1 M I Mill I! M 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNM I VRSCKCS 390 



RESULT 7 
WFMS2 

transforming growth factor beta-1 precursor - mouse 
N;Alternate names: TGF type 2; TGF-beta 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Dec-1986 #sequence__revision 04-Dec-1986 #text_change 24-Nov-1999 
C;Accession: A01396 

R;Derynck, R. ; Jarrett, J.A.; Chen, E.Y.; Goeddel , D.V. 
J. Biol. Chem. 261, 4377-4379, 1986 



A; Title: The murine transforming growth factor-beta precursor. 
A;Reference number: A01396; MUID : 86168 129 ; PMID:3007454 
A; Accession : AO 13 96 
A; Molecule type: mRNA 
A;Residues: 1-390 <DER> 

A; Cross-references: GB:M13177; NID:g201952; PIDN : AAA40423 . 1 ; PID:g201953 
A;Note: the authors suggest that residues 8-23 could represent the hydrophobic 
core of an amino- terminal signal peptide 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen; 
transformation 

F ; 1 -23 /Domain : signal sequence ^status predicted <SIG> 
F;24-278/Domain: propeptide tfstatus predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F; 279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F; 82 , 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 85.5%; Score 1855.5; DB 1; Length 390; 

Best Local Similarity 85.5%; Pred. No. 2.3e-142; 

Matches 348; Conservative 15; Mismatches 27; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 60 

I lllllllllllll 1 1 1 1 1 1 1 II I ! : I I I ! ; , i I , I I I ' I I Mi I , I I 

Db 1 MP PSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I M M h 1 1 1 1 1 1 1 1 i III : 1 1 1 1 1 1 ■ I Ml I : 1 1 1 1 II II 1 1 1 1 Ml 1 1 1 1 ' : II 

Db 61 SPPSQGEVPPGPLPE^VTALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I II M I MlhMIIIIII MIIIIMI III lUMIMMUhMI 

Db 121 YEKTKDI SHS I YMFFNTSDI REAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAI EGFRLSAH CS CDS KDNTLHVE I NGFN 24 0 

II 1 1 1 1 hMMIIIIMMIIIIII : : hill M 1 1 1 1 M 1 1 1 Mil 

Db 181 YLGNRLLTPTDTP EWLS FDVTG WRQWLNQGDG I QGFRFS AHCS CDS KDNKLHVE I NG I S 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTO 300 

Mill Ml M I MM II 1 1 1 II II II III M II 1 1 1 r I 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTO 283 

Qy 301 YCFSSTEKNCOmQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

i r 1 1 r i j i r 1 1 r 1 1 1 1 1 1 1 1 1 1 f r r 1 1 1 r 1 1 1 r j i f 1 1 1 1 1 1 1 1 1 1 1 r 1 1 r r 1 1 1 1 f 1 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 r r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 344 NQHNPGASASPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 8 
S10219 

transforming growth factor beta-1 precursor - rat 
N; Alternate names: TGF type 2; TGF-beta 



C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Feb-1993 #sequence_revision 12-Feb-1993 #text_change 24-Nov-1999 
C;Accession: S10219; PT0023; S02267 

R;Qian, S.W.; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Nucleic Acids Res. 18, 3059, 1990 

A/Title: cDNA cloning by PCR of rat transforming growth factor beta-1. 
A /Reference number: S10219; MUID : 90272425 ; PMID:2349108 
A /Access ion: SI 02 19 
A;Molecule type: mRNA 
A;Residues: 1-390 <QIA> 

A; Cross-references : EMBL:X52498; NID:g57341; PIDN : CAA3674 1 . 1 ; PID:g57342 
R;Okada, F . ; Yamaguchi, K. ; Ichihara, A.; Nakamura , T. 
J. Biochem. 106, 304-310, 1989 

A;Title: Purification and structural analysis of a latent form of transforming 
growth factor-beta from rat platelets. 

A/Reference number: PT0023; MUID : 90036779 ; PMID:2478527 
A; Accession : PT0023 
A;Molecule type: protein 

A;Residues: 30-32, 'X' ,34-38, ! Q' ,40-42, 'X' ,44 <OKA> 
R;Okada, F.; Yamaguchi, K. ; Ichihara, A.; Nakamura, T. 
FEBS Lett. 242, 240-244, 1989 

A;Title: One of two subunits of masking protein in latent TGF-beta is a part of 
pro-TGF-beta . 

A;Reference number: S02267; MUID : 8 912 1 078 ; PMID:2914605 
A; Access ion : S 02 267 
A;Molecule type: protein 

A;Residues: 30-32 , 1 X ' , 34 -38 , 1 Q 1 , 40 -42 , « X ■ , 44 <OK2> 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; integrin binding 
F; 1-29/Domain: signal sequence #status predicted <SIG> 
F;30-278/Domain: propeptide #status experimental <PR0> 
F;244-246/Region: cell attachment (R-G-D) motif 

F ; 279 -3 90/ Product : transforming growth factor beta-1 #status predicted <MAT> 
F; 82 , 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 85.5%; Score 1855.5; DB 2; Length 3 90; 

Best Local Similarity 85.5%; Pred . No. 2.3e-142; 

Matches 34 8; Conservative 14; Mismatches 28; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I lllllllllllll I II M I I I I I I I M I I I M I II MM I II Ml I M I I I I I I I I 
Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I I I : I I I I I I I : I : I I 1 i I , I I I I ' 1 I MIIIIIIMIIIIIIIIIIh II 
Db 61 S P PS QGE VP PG P L P EA VLAL YNSTRDR VAGE SAD P E P E P EAD Y YAKE VTR VLM VDRNNA I 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I Ihll I llh: II MM I lllllllll I I I III II MM II hi II 
Db 121 YDKTKDI THS I YMFFNTSDI REAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 240 

M MM h h I M II II II M M II I : hill MIMIIMM lllllll : 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDG I QGFRFSAHCS CDS KDN VLHVE I NG I S 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

Mill III III lllllllllllll I MM Mill Mill 



Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR 



•ALDTN 2 83 



Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

1 1 : 1 1 1 II 1 1 1 1 1 1 M 1 1 1 ; II 1 1 1 1 1 1 1 1 1 1 H 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 3 61 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 II 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 M I II I 1 1 M . 

Db 344 NQHNPGASASPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 9 
A40057 

transforming growth factor beta-1 precursor - bovine (fragment) 

N;Alternate names: beta-TGF; cartilage-inducing factor-A; EGF -dependent TGF or 

dEGF; MGF-b; milk growth factor b; TGF- type II 

C;Species: Bos primigenius taurus (cattle) 

C;Date: 28-Feb-1992 #sequence_revision 28-Feb-1992 #text__change 16-Jul-1999 
C;Accession: A40057; A42320; A05284; A24322; B61439 

R; Van Obberghen -Schilling, E.; Kondaiah, P.; Ludwig, R.L.; Sporn, M.B.; Baker, 
C.C. 

Mol. Endocrinol. 1, 693-698, 1987 

A; Title: Complementary deoxyribonucleic acid cloning of bovine transforming 
growth f actor-betal . 

A;Reference number: A40057; MUID : 91042552 ; PMID:3153459 
A; Accession : A40057 
A; Molecule type: mRNA 
A/Residues : 1-315 <VAN> 

A/Cross-references : GB:M36271; NID:gl63747; PIDN : AAA30778 . 1 ; PID:gl63748 
R;Ogawa, Y. ; Schmidt, D.K.; Dasch, J . R . ; Chang, R.J.; Glaser, C.B. 
J. Biol. Chem. 267, 2325-2328, 1992 

A;Title: Purification and characterization of transforming growth f actor-beta2 . 3 

and -betal.2 heterodimers from bovine bone. 

A;Reference number: A42320; MUID: 92129307 ; PMID: 1733936 

A;Accession: A42320 

A;Molecule type: protein 

A;Residues: 204 -2 09 , ' X ■ , 2 11 -2 17 <OGA> 

R;Roberts, A.B.; Anzano, M . A . ; Meyers, C.A. ; Wideman, J.; Blacher, R.; Pan, 
Y.C.E.; Stein, S.; Lehrman, S.R.; Smith, J.M.; Lamb, L.C.; Sporn, M.B. 
Biochemistry 22, 5692-5698, 1983 

A;Title: Purification and properties of a type beta transforming growth factor 
from bovine kidney. 

A;Reference number: A05284; MUID : 84 104793 ; PMID:6607069 
A /Access ion : AO 52 84 
A;Molecule type: protein 
A/Residues : 204-218 <ROB> 

R;Seyedin, S.M.; Thompson, A.Y.; Bentz, H. ; Rosen, D.M.; McPherson, J.M.; Conti, 
A. ; Siegel, N.R.; Galluppi, G.R.; Piez, K.A. 
J. Biol. Chem. 261, 5693-5695, 1986 

A;Title: Cart i lage - indue ing factor-A. Apparent identity to transforming growth 
factor-beta . 

A;Reference number: A24322; MUID : 86 1 95954 ; PMID:3754555 
A; Accession : A243 22 
A;Molecule type: protein 
A;Residues: 204-233 <SEY> 

R;Jin, Y. ; Cox, D.A.; Knecht , R . ; Raschdorf, F. ; Cerletti, N. 
J. Protein Chem. 10, 565-575, 1991 



A;Title: Separation, purification, and sequence identification of TGF-betal and 
TGF-beta2 from bovine milk. 

A; Reference number: A61439; MUID: 92189724 ; PMID:1799413 
A; Accession: B614 39 
A;Molecule type: protein 

A; Residues: 2 04-2 09, *X' ,211-217, 'XX' ,22 0-232 <JIN> 

C;Comment: This polypeptide is composed of two polypeptide chains cross-linked 
by disulfide bonds. It has been found in neoplastic and non-neoplastic tissues. 
C; Comment: Type II TGF does not bind to the EGF receptor and lacks intrinsic 
mitogenic activity, but in soft agar, it reacts synergist ically with either type 
I TFG or EGF, and induces cell proliferation. Cells grown in monolayer do not 
respond in a similar manner to these growth factors, but morphologically do 
acquire a transformed phenotype. 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; heterodimer 

F;204-315/Product : transforming growth factor beta-1 #status experimental <MAT> 
F;7,61, 101/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 73.4%; Score 1592.5; DB 2; Length 315; 

Best Local Similarity 89.5%; Pred. No. 3.2e-121; 

Matches 297; Conservative 9; Mismatches 9; Indels 17; Gaps 1; 
Qy 76 AvTiALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 13 5 

= 1 1 1 1 1 1 r 1 1 1 1 r 1 1 1 i iiiiiiiiiiiiiiiiiiii ihiin i : ihii i 

Db 1 AI LALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI YDKMKSSSHS I YMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

I II 1 1 1 II II 1 1 1 1 M II : U 1 1 1 M M II II M M 1 1 M h I M 1 1 1 II M II I II M I 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 12 0 

Qy 196 LSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEI NGFNSGRRGDLATI HGMNR 255 

Illllllllllllllll lllllllllllllllllll hi M hi III I M II I !l 1 1 1 

Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 180 

Qy 256 PFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQL 315 

I I I I I II h I I h h ' I I I I I llllllllllllllllllll 
Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQL 223 

Qy 316 YIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVP 375 

< 1 1 ii i ii 1 1 1 1 1: hi 1 1 1 1 h i ii 1 1 1 hi 1 1 h 1 1 1 1 1 1 1 ih ii ii i ii hi i ii 

Db 224 YIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVP 283 

Qy 376 QALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 407 

I [ 1 1 1 1 1 1 1 1 1 1 ! ; I ; 1 1 1 [ 1 1 i ! . 1 1 1 j 1 1 i 

Db 284 QALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 10 
A41918 

transforming growth factor beta-4 precursor - chicken (fragment) 
N;Alternate names: TGF-beta 4 
C;Species: Gallus gallus (chicken) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 16-Jul-1999 

C;Accession: A41918; A34941; S03110 

R ; Burt , D . W . ; Jakowl ew , S . B . 

Mol . Endocrinol. 6, 989-992, 1992 



A; Title: Correction: a new interpretation of a chicken transforming growth 
factor-beta 4 complementary DNA. 

A; Reference number: A41918; MUID : 9235703 9 ; PMID:1353860 
A; Accession: A41918 
A; Molecule type: mRNA 
A/Residues : 1-373 <BUR> 

A; Cross-references: GB:M31160; GB:X08012; GB:S41706; NID : gl262437 ; 
PIDN-.AAB05637 . 1; PID:gl262438 

A;Note: sequence extracted from NCBI backbone (NCBIN: 110186 , NCBIP : 110187) 
A;Note: this report corrects and reinterprets the sequence from reference A34941 
R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 2, 1186-1195, 1988 

A; Title: Complementary deoxyribonucleic acid cloning of a messenger ribonucleic 
acid encoding transforming growth factor beta-4 from chicken embryo 
chondrocytes . 

A;Reference number: A34941; MUID : 89112198 ; PMID : 2464131 
A /Accession : A3 4 941 
A; Molecule type: mRNA 

A; Residues : ' MDPMSIGPKSCGGSPWRPPGTAPWSIGSRRATASSSCSTSSRVRAEVGGRAL ' , 122- 

209, 1 D f ,211-373 <JAK> 

A; Cross-references : EMBL:X08012 

A;Note: this sequence has been corrected in A41918 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor 

F; 1/Domain: signal sequence (fragment) #status predicted <SIG> 
F;223-225/Region: cell attachment (R-G-D) motif 

F; 2 60 -3 73 /Product : transforming growth factor beta-4 #status predicted <MAT> 
F; 54 , 109 , 153/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 49.2%; Score 1067; DB 2; Length 373; 

Best Local Similarity 53.5%; Pred. No. 1.4e-78; 

Matches 209; Conservative 52; Mismatches 98; Indels 32; Gaps 8; 

Qy 30 LSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

1111= =hl hIMIhllllllMII :|| : II llh I 111111- 
Db 2 LSTCQRLDLEAAKKKR I EAVRGQ I LS KLRLTAP P PAS ET PPRPLPDDVRAL YNSTQELLK 61 

Qy 9 0 GES-VEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPV 148 

: : | |: :|:|||: |: | : : : :: ||:: :|| | | | 
Db 62 QRARLRPPPDGPDEYWAKELRRI PMETTWDGAMEHWQPQSHSI FFVFNVSRARRG-GRPT 120 

Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 2 03 

II llllhll I II Ml | I | M I I I : : llllllll 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 18 0 

Qy 204 VRQWLTRREAI EGFRLSAHCSCD SKDNTLHVEINGFNSGRRGDLATI HGMNR - - PFL 258 

I llh I : hll llh : I II :|lh =1 =| I- 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMOSIAKKHRRVPYV 239 

Qy 259 LLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCF- -SSTEKNCCVRQLY 316 

I M I Ml llhl II Ilhlll : lllllll II 

Db 24 0 LAMALPAERANELHSARRRR DLDTDYCFGPGTDEKNCCVRPLY 282 

Qy 317 IDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQ 376 

lllllll i,l '"I lllhlllMIII 1 1 hi I IN !l 1, 1 1 1 1 1 1 1 1 1 M II 

Db 283 IDFRKDLQWKWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQ 342 



Qy 377 ALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 407 

hlllhlllll : I I I M I I: I Ml II 
Db 34 3 TLDPLPI I YYVGRNVRVEQLSNMWRACKCS 3 73 



RESULT 11 
A34939 

transforming growth factor beta-3 precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 13-Jul-1990 #sequence_revision 13-Jul-1990 #text_change 16-Jul-1999 
C;Accession: A34939; S25850; S36125; S36124; 151181 

R/Jakowlew, S.B.; Dillard, P.J.; Kondaiah, P.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 2, 141-755, 1988 

A; Title: Complementary deoxyribonucleic acid cloning of a novel transforming 
growth factor-beta messenger ribonucleic acid from chick embryo chondrocytes. 
A;Reference number: A34939; MUID : 8 9096966 ; PMID : 3211158 
A; Accession : A34 93 9 

A; Status : preliminary; not compared with conceptual translation 
A /Molecule type: mRNA 
A;Residues: 1-412 <JAK> 

A; Cross-references : GB:M31154; NID:g212758; PIDN : AAA4908 9 . 1 ; PID:g212759 
R;Burt, D.W.; Paton, I.R.; Dey, B , R . 
J. Mol. Endocrinol. 1, 175-183, 1991 

A;Title: Comparative analysis of human and chicken transforming growth factor- 
beta-2 and -beta-3 promoters. 

A; Reference number: S25850; MUID : 92 1344 96 ; PMID:1840616 
A /Access ion : S2585 0 
A; Status : prel iminary 
A /Molecule type: DNA 
A/Residues: 1-117 <BUR> 

A/ Cross-references : EMBL: X58127 ; NID:g63815; PIDN : CAA4 1128 . 1 / PID:g63816 
A/ Access ion: S3 6 12 5 

A; Status: preliminary/ nucleic acid sequence not shown; translation not shown 
A/ Molecule type: DNA 
A ; Residues: 119-172 <BU2> 

A; Cross-references: EMBL: X60055 ; NID:g396688; PIDN : CAA42 653 . 1 ; PID:g396689 
A;Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

A; Accession: S3 6 124 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A /Molecule type: DNA 

A;Residues: 173 -322 ELPT ', 327-412 <BU3> 
A; Cross-references : EMBL: X60091 

A;Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

R;Jakowlew, S.B.; Lechleider, R. ; Geiser, A.G.; Kim, S.J./ Santa-Coloma , T.A. / 
Cubert, J./ Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 6, 1285-1298, 1992 

A/Title: Identification and characterization of the chicken transforming growth 
factor-beta 3 promoter. 

A/ Reference number: 151181/ MUID : 93 024487 / PMID:1406706 
A /Access ion : 151181 
A/ Status : prel iminary 
A,- Molecule type: DNA 
A/Residues: 1-117 <JA2> 

A/ Cross-references : GB:S46000; NID:g257172/ PIDN : AAB23575 . 1 / PID:g257173 
C;Genetics : 



A;IntronS: 216/1; 252/1; 309/2; 360/3 
A;Note: list of introns may be incomplete 
C; Super family: inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen 
F ; 1-22/ Doma in : signal sequence #status predicted <SIG> 
F;23-300/Domain: propeptide #status predicted <PRO> 

F;301-412/Product : transforming growth factor beta-3 #status predicted <MAT> 
F;74 , 142/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 41.0%; Score 888.5; DB 2; Length 412; 

Best Local Similarity 46.2%; Pred . No. 4.4e-64; 

Matches 198; Conservative 56; Mismatches 114; Indels 61; Gaps 13; 
Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

h I =h = MM hhl MMIMMMMIIMM Ml I I 

Db 9 LVLLSLLSFATVSLALSSCTTLDLEHIKKKRVEAIRGQILSKLRLTSPPE--SVGPAHVP 66 

Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADY YAKEVTRVLMVE SGNQI YDKF 124 

MINIMI: : | :| | | hMMIh = h= | = : 

Db 67 YQILALYNSTRELL- - EEMEEEKEESCSQENTESEYYAKEIHKFDMIQGLPEHNELGI CP 124 

Qy 125 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND 177 

II II I I I III IMh : M MIM I 

Db 125 KGVTSNVFR- FNVS SAEKNSTNLFRAEFRVLRVPNPSSKRSEQRI ELFQI LRPDEHI 180 

Qy 178 - SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSC D 226 

MM I : MMIMMI MM I 1 1 = I Mill 

Db 181 AKQRYLSGRNVQTRGSPEWLSFDVTDTVREWLLHRESNLGLEISIHCPCHTFQPNGDILE 240 

Qy 227 SKDNTLHVE I NGFNSG RRGDLATI HGMNRPFLLLMATPLERAQH- -LHSSRHRR 278 

I :: I M MM : == I IMI I I = I I M 

Db 241 NLHEVLEI KFKGI DSEDDYGRGDLGRLKKQKDLHNPHLILMMLPPHRLESPTLGGQRKKR 300 

Qy 2 79 ALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANF 33 8 

MMMM : IMMM 1 1 1 1 M M M 1 1 1 M 1 1 1 1 1 Ml 

Db 3 01 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYFANF 343 

Qy 33 9 CLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSN 3 98 

I MMh I II M M III M IMMMMI MM IMMM MMMM 

Db 344 CSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSN 4 03 

Qy 3 99 MIVRSCKCS 4 07 

IMMIMI 

Db 4 04 MWKSCKCS 412 



RESULT 12 
B61036 

transforming growth factor beta-5 precursor - African clawed frog 
C; Species: Xenopus laevis (African clawed frog) 

C;Date: 31-Dec-1993 #sequence__revision 03-Feb-1994 #text_change 16-Jul-1999 
C;Accession: A34929; B61036 

R;Kondaiah, P.; Sands, M.J.; Smith, J.M.; Fields, A.; Roberts, A.B.; Sporn, 
M.B.; Melton, D . A . 

J. Biol. Chem. 265, 1089-1093, 1990 

A;Title: Identification of a novel transforming growth factor-beta (TGF-beta5) 
mRNA in Xenopus laevis. 



A;Reference number: A34929; MUI D : 90110090 ; PMID:2295601 
A;Accession: A34929 
A;Molecule type: mRNA 
A; Residues: 1-382 <KON> 

A;Cross-references: GB:J05180; NID:g214821; PIDN : AAA49968 . 1 ; PID:g214822 
R;Roberts, A.B.; Rosa, F . ; Roche, N.S.; Coligan, J.E.; Garfield, M. ; Rebbert , 
M.L.; Kondaiah, P.; Danielpour, D. ; Kehrl , J.H.; Wahl , S.M.; Dawid, I.B.; Sporn, 
M.B. 

Growth Factors 2, 135-147, 1990 

A; Title: Isolation and characterization of TGF-beta2 and TGF-beta5 from medium 
conditioned by Xenopus XTC cells. 

A;Reference number: A61036; MUID : 90253806 ; PMID:2340184 
A; Accession: B6103 6 
A;Molecule type: protein 

A;ResidueS: 271-276, 'X' , 278-284 , *XX' , 287-299 <ROB> 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F; 271 -3 82 /Product : transforming growth factor beta-5 #status experimental <MAT> 

Query Match 4 0.9%; Score 8 8 7.5; DB 2; Length 382; 

Best Local Similarity 46.4%; Pred. No. 4.8e-64; 

Matches 192; Conservative 54; Mismatches 121; Indels 47; Gaps 11; 

Qy 9 LPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDV 68 

: M II I I I I I Mill :||| I :: = I I I I I I I I I I I I I : I I I 

Db 1 MEVXiWMLLVIjLVLHLSSLAMSLSTCKAVDMEEVRKRRI EAI RGQI LSKLKLDKTPDV- DS 59 

Qy 69 PPGPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDK 123 

M : Mill : : - | : MIMM I II - I 

Db 60 EKMTVPSEAIFLYNSTLEVIREKATREEEHVGHDQNIQDYYAKQVYR FESITELED- 115 

Qy 124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL--KVEQHVELYQKYSNDSW-- 179 

I II I Ml I II Mlh : : ==l MM II : 

Db 116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 170 

Qy 180 -RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSC DSKDNTLHVE 235 

III = I hi, Ml I Ml II I I I II Ml 
Db 171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 

Qy 236 I NGFNSGRRGDLATI HGM - -NRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYA 2 93 

I II I I I I I : : MM- : | || : ||| :| 
Db 227 IEGF-PALRGDLASLSSKENTKPYLMITSMPAERIDTVTSSRKKR 270 

Qy 2 94 SLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQY 3 53 

= III : Mlh I I I : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMII IMMIMIII 

Db 271 --GVGQEYCFGNNGPNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQY 328 

Qy 354 SKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 407 

I I I I : I I I I H I I I I MMII I I I II I =! I I I i I 1 I I I I I I : I I I I || 
Db 329 SKVLSLYNQNN PGAS I S PCCVPDVLEPLP 1 1 YYVGRTAKVEQLSNM WRSCNCS 382 



RESULT 13 
S01825 

transforming growth factor beta-3 precursor - pig 
C;Species: Sus scrofa domestica (domestic pig) 

C;Date: 30-Sep-1989 #sequence_revis ion 30-Sep-1989 #text_change 16-Jul-1999 



C; Access ion: S01825 

R/Derynck, R.; Lmdquist, P.B.; Lee, A./ Wen, D. ; Tamm, J.; Graycar, J.L.; Rhee, 
L . ; Mason, A.J.; Miller, D.A.; Coffey, R.J.; Moses, H.L.; Chen, E.Y. 
EMBO J. 7, 3737-3743, 1988 

A;Title: A new type of transforming growth factor-beta, TGF-beta3 . 
A/Reference number: S01824; MUID : 89091120 ; PMID:3208746 
A; Accession: S01825 
A; Molecule type: mRNA 
A;Residues: 1-409 <DER> 

A; Cross-references : EMBL-.X14150; NID:g2127 ; PIDN : CAA32363 . 1 ; PID:g2128 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F ; 1-25/ Doma in : signal sequence tfstatus predicted <SIG> 
F; 26-297/ Doma in : propeptide #status predicted <PR0> 

F/298-409/Product : transforming growth factor beta-3 #status predicted <MAT> 

Query Match 3 9.2%; Score 8 51; DB 2; Length 4 09; 

Best Local Similarity 44.6%; Pred. No. 4.8e-61; 

Matches 190; Conservative 58; Mismatches 122; Indels 56; Gaps 13; 



Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EA1RGQILSKLRLASPPSQGDVPPGPLP 74 

h I • I : MM hi : : II II : II I II II II I II I Ml = M 

Db 7 LVVLALLNFATVSLSMSTCTTLDFDHI KRKRVEAI RGQI LSKLRLTSPPDPSML- -ANI P 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMV ESGNQI YDKFKG 126 

II MINI : I II : ■ h:||llh I = I I = II 

Db 65 TQVLDLYNSTRELLEEVHGERGDDCTQENTESEYYAKEI YKFDMIQGLEEHNDLAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: || | : : | ||| |:|h = II HMI I 

Db 125 1TSKIFR-FNVSSVEK NETNLFRAEFRVLRMPNPSSKRSEQRI ELFQILQPDEHIAK 180 

Qy 17 9 WRYLSNRLLAPSDS PEWLSFDVTG WRQWLTRREAI EGFRLSAHCSC DSK 22 8 

I M : I : I I ' , I I I I ||:|| |||: I HIM 

Db 181 QR Y I DGKNL PTRGAAEWLS FDVTDTVREWLLRRESNLGLE I S I HCPCHTFQ PNGD I LEN I 24 0 

Qy 229 DNTLHVEI NGFNS GRRGDLATI HGM - -NRPFLLLMATPLERAQH- - LHSSRHRRALD 281 

: =: I • I MM : = I MM I M • I • I M 
Db 241 QEVMEIKFKGVDSEDDPGRGDLGRLKKKKEHSPHLILMMI PPDRLDNPGLGAQRKKR 2 97 

Qy 2 82 TNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLG 341 

IMIIMI : MIIMI 'I I Mh I Mil h I ! I II hi II I I 
Db 2 98 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSG 343 

Qy 34 2 PCPY1WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIV 4 01 

HIM I II M II III II MM II Ml I MM MIIMI I II M II I M 

Db 344 PCPYLRSADTTHSSVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTAKVEQLSNMW 403 

Qy 4 02 RSCKCS 4 07 

MUM 

Db 4 04 KSCKCS 4 09 



RESULT 14 
A41397 

transforming growth factor beta-3 precursor - mouse 
C; Species: Mus musculus (house mouse) 



C;Date: 03-Apr-1992 #sequence_revis ion 03-Apr-1992 #text_change 16-Jul-1999 
C;Accession: A41397; A61039; A61225 

R;Miller, D.A.; Lee, A.; Matsui, Y . ; Chen, E.Y.; Moses, H.L.; Derynck, R. 
Mol. Endocrinol. 3, 1926-1934, 1989 

A; Title: Complementary DNA cloning of the murine transforming growth factor- 

beta3 (TGFbeta3) precursor and the comparative expression of TGFbeta3 and 

TGFbetal messenger RNA in murine embryos and adult tissues. 

A;Reference number: A41397; MUID : 90190650 ; PMID:2628730 

A; Accession : A41397 

A; Molecule type: mRNA 

A;Residues: 1-410 <MIL> 

A; Cross-references : GB:M32745; NID:g201949; PIDN : AAA40422 . 1 ; PID:g201950 
R;Denhez, F . ; Lafyatis, R.; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Growth Factors 3, 139-146, 1990 

A;Title: Cloning by polymerase chain reaction of a new mouse TGF-beta, mTGF- 
beta3 . 

A;Reference number: A61039; MUID : 91000714 ; PMID:2206556 
A;Accession: A61039 
A; Molecule type: mRNA 
A;ResidueS: 1-410 <DEN> 

R;Watrin, F . ; Scotto, L. ; Assoian, R.K.; Wolgemuth, D.J. 
Cell Growth Differ. 2, 77-83, 1991 

A;Title: Cell lineage specificity of expression of the murine transforming 

growth factor beta-3 and transforming growth factor beta-1 genes. 

A;Reference number: A61225; MUID : 91299576 ; PMID: 2069871 

A; Access ion : A6122 5 

A; Status: translation not shown 

A; Molecule type: mRNA 

A; Residues: 285-410 <WAT> 

C;Superfamily : inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation 
F; 1-21/Domain : signal sequence #status predicted <SIG> 
F;22-298/Domain: propeptide #status predicted <PRO> 
F;259-261/Region: cell attachment (R-G-D) motif 

F;299-410/Product : transforming growth factor beta-3 #status predicted <MAT> 
F; 72 , 133 , 140/Bmding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 38.9%; Score 844.5; DB 2; Length 410; 

Best Local Similarity 44.2%; Pred. No. 1.6e-60; 

Matches 188; Conservative 57; Mismatches 127,- Indels 53; Gaps 12; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

1=1=1 : I I I I hi : h IN I I I I U I I I . I IN : 

Db 7 LVVLALLNLATISLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPSVMT- -HVP 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 

II Mill II = : II I = l-IIMh : | = : |:= II 

Db 65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEI HKFDMI QGLAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: || I : : I III hlh : || :||:| | 

Db 125 I TSKVFR - FN VSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQI LRPDEHI AK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSC DSK 228 

lh : I : llhllil ! I : ' I |lh I HIM 

Db 181 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 24 0 



Qy 229 DNTLHVE I NGFNS GRRGDLATI HGM NRPFLLLMATPLERAQHLHSSRHRRALDT 282 

: - I := I I I I : = I MM I I M 

Db 241 HEVMEI KFKGVDNEDDHGRGDLGRLKKQKDHHNPHLI LMMI PPHRLDSPGQGSQRK- - - - 296 

Qy 283 NSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGP 342 

1 1 1 1 1 r r I : MINI I I I M h I II I I H II I ! I : I I I I ii 

Db 2 97 KRALDTNYCFRNLEENCCWPLYIDFRQDLGWKWVHEPKGYYANFCSGP 345 

Qy 343 CPYIWSLDTQYSKVIALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VR 4 02 

II h I I I :| I I Ml I I I M M I I I II I I I I hi MM I III || I I |: I : 
Db 34 6 CPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWK 4 05 

Qy 4 03 SCKCS 4 07 

Mill 

Db 4 06 SCKCS 410 



RESULT 15 
WFMSB2 

transforming growth factor beta-2 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Sep-1992 #sequence_revision 30-Sep-1992 #text_change 18-Jun-1999 
C; Access ion : A4 014 8 

R;Miller, D.A.; Lee, A.; Pelton, R.W.; Chen, E.Y.; Moses, H.L.; Derynck, R. 
Mol. Endocrinol. 3, 1108-1114, 1989 

A; Title: Murine transforming growth factor-beta2 cDNA sequence and expression in 
adult tissues and embryos. 

A;Reference number: A40148; MUID : 90014832 ; PMID:2797004 
A; Accession : A4 014 8 
A /Molecule type: mRNA 
A;ResidueS: 1-414 <MIL> 

A; Cross-references: EMBL:X57413; NID:g54772; PIDN:CAA40672 . 1; PID:g54773 
C; Comment: None of the three predicted glycosylation sites is in the mature 
protein . 

C; Superf amily : inhibin 

C;Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen 
F ; 1 - 19 /Domain : signal sequence #status predicted <SIG> 
F;20-302/Domain: propeptide #status predicted <PRO> 

F; 3 03 -4 14 /Product : transforming growth factor beta-2 #status predicted <MAT> 
F ; 72, 140, 241/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 38.9%; Score 844; DB 1; Length 414; 

Best Local Similarity 44.0%; Pred. No. 1.8e-60; 

Matches 191; Conservative 61; Mismatches 120; Indels 62; Gaps 15; 
Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVP-P 70 

M Mill MM 1 : 1 1 : 1 1, 1 M hi III MM III Ml 

Db 5 VLSTFLLLHLVP - - VALSLSTCSTLDMDQFMRKR I EAI RGQI LSKLKLTSPPE - - DYPEP 60 

Qy 71 GPLPEAVLALYNSTRD RVAGESVEPEPE- PEADYYAKEVTRVLM- - -VESGNQI YD 122 

M MMIIIM : : : III MIIMI == I MM 

Db 61 DEVPPEVISI YNSTRDLLQEKASRRAAACERERSEQEYYAKEVYKIDMPSHLPSENAI PP 120 

Qy 123 KFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKV- EQHVELYQ K 173 

I M MM : : MM MM I M II Mill 

Db 121 TFY - R P Y FR I VRFDVSTMEKNASN LVKAEFR VFRLQN PKARVAEQR I EL YQ I LKS KD 176 



Qy 174 YSNDSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFRLSAHCSC 225 

- : 11= : = = = llllllll h = I I - lh=l II I 

Db 177 LTSPTQRYIDSKWKTRAEGEWLSFDVTDAVQEWLHHKDRNLGFKISLHCPCCTFVPSNN 236 

Qy 226 DSKDNTLHVE INGFNSGRRGDLATIHGMNR PFLLLMATPLERAQHLHS 273 

M I hi := MM: I I I I I I I : I 

Db 237 YIIPNKSEELEARFAGIDGTSTYASGDQKTIKSTRKKTSGKTPHLLLMLLPSYRLESQQS 296 

Qy 274 SRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKG 333 

M - Ml Ml : : I I h I MMMMMIMMIMI 

Db 297 SRRKKR ALDAAYCFRNVQDNCCLR PLY I DFKRDLGWKW I HE PKG 340 

Qy 334 YHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKV 3 93 

■■II I ! 1 1 : 1 ! 1 1 1 : M 1 1 • 1 1 1 II MI-MI I MM rMM lh 

Db 341 YNANFCAGACPYLWSSDTQHTKVLSLYNTINPEASASPCCVSQDLEPLTILYYIGNTPKI 400 

Qy 394 EQLSNMIVRSCKCS 407 

I I I I I I I h I I I I I 
Db 4 01 EQLSNMIVKSCKCS 414 



Search completed: October 28, 2003, 09:09:50 
Job time : 14.4491 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 28, 2003, 00:08:41 ; Search time 8.15764 Seconds 

(without alignments) 
2346.251 Million cell updates/sec 

Title: US-10-017-372E-39 
Perfect score: 2169 

Sequence: 1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNM I VRSCKCS 407 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 127863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



LNO . 


Score 


Match Length 


Do 


T 'Pi 


Description 


1 
X 


9 rm Pi r 

Z U 9 U . D 


94 


.5 


390 


X 


TP Wl DTP 
lur X rib 


P07200 


sus scrofa 


z 


1 Q4 Q C 


89 


. 9 


390 


1 


TO 17 1 CULTD 

Ibr X brlbhr 


P50414 


ovis aries 


9 


iyjb - 9 


89. 


.3 
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1 


Ibr 1 HUMAN 


P01137 
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1 Q T A c: 


89 


.2 
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1 
1 
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1 Q 1 9 c; 
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1 
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TP P 1 pjmda 
1 br X bAINr A 
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1QCQ C 

1 0 9 y . 9 


85 


. 7 
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1 
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85 
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1 
1 


TPDI MHI1CU 

J. br X MUUbh 
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mus musculu 


p 
O 


1 pec c 
X 0 9 9 . 9 


85 


.5 
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1 


TP P 1 DAT 
I br X KA 1 


P17246 


rattus norv 


q 


1 P A £ t; 
1 0 r± O . 9 
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. 1 
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1 
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1 n 
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"1 R Q 9 R 
1 9 i?Z . 9 


73 . 


.4 
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1 
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Ibr X oUVl fvl 
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bos taurus 


1 1 
x x 


1 D£7 
1 VJ O / 


49 


.2 
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1 
x 
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gallus gall 


X Z 


P P 7 c: 
O O / . 9 


40 


. 9 


382 


1 
1 


ibr X ArjiNXA 


P16176 


xenopus lae 


-L 9 


ft A Q R 
0 D -7 . 3 


40 


. 1 


412 


1 
1 


TPP9 nuTPi/' 


P16047 


gallus gall 


J. *± 


O 3 X 


39 , 


,2 


409 


1 

X 


TP PI DTP 
1 br 9 rib 


P15203 


sus scrofa 


1 R 
X D 


QAA tr 


38 , 


.9 


410 


X 


TP 17 7 MPTTCP 
Ibr J I'lUUbti 


P17125 


mus musculu 


1 

J. D 


ft 4 4 
Oil 


38 . 


. 9 


414 


X 


TPP9 MPTTCP 
IbrZ l v JUUi)D 


P27090 


mus musculu 


1 7 


P A 9 

O t 9 . 9 


38 


. 9 


412 


X 


TP P 9 DAT 
1 br 9 K>\ i. 


Q07258 


rattus norv 


1 p 


P A 1 R 
O 1 X . 3 


38 


.8 


412 


X 


TP PI UTTMAM 

Ibr J nUNAJNI 


P10600 


homo sapien 




P 9 Q 
0 Z j 


38 , 


.2 


414 


1 
X 


TPPO UTTMAM 

IbrZ rlUMAJNI 


P08112 


homo sapien 


9 0 
Z U 


P 9 R R 
O Z O . 9 


38 . 


.2 


413 


X 


TPPO YPT\TT A 

IbrZ AbiNXaA. 


P17247 


xenopus lae 


9 1 

Z X 


O Z 3 


38 . 


. 0 


412 


1 
1 


TPP9 PUTPV 
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P30371 
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9 9 


ft 9 R 
O Z 3 


38 . 


,0 
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1 
X 


TPP9 DTP 
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Z J 


0 Z 9 
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. 0 
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1 
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9 A 
Z ft 


4 ft 9 
1 O Z 


22 . 


,2 
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X 
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9 

Z 9 


9 9 9 R 


14 


. 9 
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1 
X 


phpp mpt pa 
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042221 
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9£ 
Z D 


19 1 R 


14 , 


. 8 
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I 
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042220 


gallus gall 


9 7 
Z / 


7 19 R 
9 XZ . 3 


14 . 


.4 


375 


X 


phpp dtp 

bUr 0 rib 


018831 


sus scrofa 


9 P 
Z 0 


9 1 x . 3 


14 . 


.4 


375 


1 


bDr 0 rArnA 


018828 


papio hamad 


9 Q 

z y 


9 U • 3 


14 


.3 


376 


1 


riniT Q DAT 

bDr 0 KA1 


035312 


rattus norv 


9 u 


1 Aft c; 


14 . 


.2 


375 


1 
1 


pnUQ U1TMAM 

bDr 0 nUMAiNI 


014793 


homo sapien 


J 1 


1 r\ 7 c; 
9 U / . 3 
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.2 


376 
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1 


PPlPP MPT TCP 
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9 9 
J z 
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.1 


375 


X 


PHPP CUPPD 

bi9r 0 oncibr 


018830 


ovis aries 


33 


2 98 


13 . 


, 7 


405 


1 


PDFR MOTT^F 

\JLJl O 1 W O Hi 


Q9zlw4 


mus musculu 


34 


298 


13 . 


, 7 


407 


1 


GDFB~HUMAN 


095390 


homo sapien 


35 


296.5 


13 . 


. 7 


375 


1 


GDF8 BOVIN 


018836 


bos taurus 


36 


280.5 


12 . 


. 9 


374 


1 


GDF8__BRARE 


042222 


brachydanio 


37 


276.5 


12 . 


.7 


345 


1 


GDFB_RAT 


Q9z217 


rattus norv 


38 


275 


12 . 


7 


425 


1 


IHBA_SHEEP 


P43032 


ovis aries 


39 


271 .5 


12 . 


. 5 


426 


1 


IHBA HORSE 


P55102 


equus cabal 


40 


270.5 


12 . 


5 


424 


1 


IHBA_PIG 


P03970 


sus scrofa 


41 


270 


12 . 


.4 


425 


1 


IHBA_BOVIN 


P07995 


bos taurus 


42 


268 .5 


12 . 


4 


424 


1 


IHBA_MOUSE 


Q04998 


mus musculu 


43 


268 .5 


12 . 


. 4 


424 


1 


IHBA RAT 


P18331 


rattus norv 


44 


265.5 


12 . 


2 


424 


1 


IHBA_CHICK 


P27092 


gallus gall 


45 


263 


12 . 


1 


424 


1 


BM10_HUMAN 


095393 


homo sapien 



ALIGNMENTS 



RESULT 1 



TGF1_PIG 

ID TGF1_PIG STANDARD; PRT; 3 90 AA. 

AC P07200; P08832; 

DT 01-APR-1988 (Rel . 07, Created) 

DT 01-APR-1988 (Rel. 07, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

OX NCBI_TaxID=9823 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary ; 

RX MEDLINE=87174844 ; PubMed=3470708 ; 

RA Derynck R . , Rhee L . ; 

RT "Sequence of the porcine transforming growth factor-beta precursor."; 

RL Nucleic Acids Res. 15:3187-3187(1987). 

RN [2] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RC STRAIN=Miniature swine; 

RX MEDLINE=89054010; PubMed=2461367 ; 

RA Kondaiah P., van Obberghen-Schil 1 ing E. , Ludwig R.L., Dhar R., 

RA Sporn M.B., Roberts A.B.; 

RT 11 cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

RT Evidence for alternate splicing and polyadenylat ion . " ; 

RL J. Biol. Chem. 263:18313-18317(1988). 

RN [3] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RX MEDLINE=88335639; PubMed=3 16652 0 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Nucleotide sequence of chicken transforming growth factor-beta 1 

RT (TGF-beta 1) 

RL Nucleic Acids Res. 16:8730-8730(1988). 

RN [4] 

RP SHOWS THAT REF . 3 SEQUENCE IS FROM PIG. 

RA Jakowlew S . B . ; 

RL Unpublished observations (MAR-1996) . 

RN [5] 

RP SEQUENCE FROM N.A. , AND VARIANT VAL-114. 

RA Wimmers K. , Chomdej S., Ponsuksili S., Schellander K. ; 

RT "Polymorphism in the porcine transforming growth factor beta 1 

RT gene . " ; 

RL Submitted (DEC-2001) to the EMBL/GenBank/DDB J databases. 

RN [6] 

RP SEQUENCE OF 279-322. 

RX MEDLINE=87102890; PubMed=2 8 79635 ; 

RA Cheifetz S., Weatherbee J. A. , Tsang M.L.S., Anderson J.K., Mole J.E., 

RA Lucas R., Massague J.; 

RT "The transforming growth factor-beta system, a complex pattern of 

RT cross -reactive ligands and receptors."; 

RL Cell 48:409-415(1987). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 
CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 



CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS . 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION : Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC -!- CAUTION: REF.3 SEQUENCE WHICH WAS SAID TO ORIGINATE FROM CHICKEN 
CC WHITE LEGHORN, SEEMS (REF.4) TO ORIGINATE FROM PIG. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y00111; CAA68291.1, 

DR EMBL; M23703; AAA64616.1, 

DR EMBL; X12373; CAA30933.1; 

DR EMBL; AF461808; AAL57902.1; 

DR PIR; A27512; A27512 . 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPR003911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




DR 


PRINTS; 


PR01423; 


TGFBETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal; Polymorphism. 


FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


VARIANT 


114 


114 


L -> V. 


FT 


CONFLICT 


6 


7 


LR -> PG (IN REF. 3) . 


FT 


CONFLICT 


180 


180 


R -> G (IN REF. 3) . 


FT 


CONFLICT 


237 


237 


N -> NA (IN REF. 3) . 


SQ 


SEQUENCE 


390 AA; 44294 MW; 


A6E2C3659FC384E6 CRC64 ; 



Query Match 94 . 5%; 

Best Local Similarity 95.3%; 
Matches 388; Conservative 



Score 2050.5; DB 1; Length 390; 
Pred. No. 6.5e-157; 
1; Mismatches 1; Indels 17; 



Gaps 



1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 



I IMIIIIMIMMIMIIIIIIIIMIIIIIIIIIIMIMIIIIIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

II MM Ml II I I , I i I ! I I I I 1 I : i I I : ! I I I I I M I : ' I I I . I 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMLESGNQI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I ! I'l II I M , T I M M I II II 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 ' L M 1 1 1 1 1 1 II I Ml 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 240 

11:1,11 II lllli l.llllll III I M llli'l.MI. IMIIMI II : MI ; | 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 

1 1 1 M M 1 1 1 1 , I M M M I 1 1 1 M 1 1 1 1 MMI 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

. 1 1 I i 1 1 M 1 1 I ! 1 1 I ' I ; I I I 1 1 ^ > I I 1 1 ! M II I I 1 1 I 1 1 1 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

MM M I M 1 1 II II M IM II 1 1 M II II I II 1 1 1 1 1 II I M II M 

Db 344 NQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 

RESULT 2 
TGF1_SHEEP 

ID TGF1_SHEEP STANDARD; PRT; 3 90 AA . 

AC P50414; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chorda ta ; Craniata; Vertebra ta; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart lodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=994 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95121932; PubMed=782 1809 ; 

RA Woodall C.J., McLaren L. J. , Watt N.J.; 

RT "Sequence and chromosomal localisation of the gene encoding ovine 

RT latent transforming growth factor-beta 1."; 

RL Gene 150:371-373(1994). 

RN [2] 

RP SEQUENCE OF 281-390 FROM N.A. 

RC STRAIN=Merino ; TISSUE=Skin ; 

RX MEDLINE=95268698; PubMed=774 962 1 ; 

RA Sutton R., Ward W.G., Raphael K.A. , Cam G.R.; 

RT "Growth factor expression in skin during wool follicle development."; 

RL Comp. Biochem. Physiol. HOB : 697-705 ( 1995) . 

CC FUNCTION : TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 



CC PROLIFERATION , DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS , 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 

CC -I- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X76916; CAA54242.1; -. 

DR EMBL; L36038; AAA31526.1; -. 

DR PIR; 146463; 146463. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; I PRO 0183 9; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD00 03 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA__1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 


POTENTIAL. 


FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


442$ 


)1 MW; 1C247299484D0E57 CRC64 ; 



Query Match 8 9.9%; Score 1949.5; DB 1; Length 390; 

Best Local Similarity 90.2%; Pred. No. 8.1e-149; 

Matches 367; Conservative 10; Mismatches 13; Indels 17; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I llllllllllllllllhllllll IIIIIIIIMIIIIIII lllllllllllllll 
Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DM ELVKR KG I EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 'Ih 1 II I' 1 1 II 1 1 II ' I .'1 1 1 1 ,11 1 1 II II MM ' I MM 



Db 



61 SPPSQGDVPPGPLPEAIIALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 120 



Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


284 


Qy 


361 


Db 


344 



M MM II I MM 1 1 1 M M I III II MM II 1 1 1 1 hi Ml M II 1 1 1 1 1 hi 



SGRRGDIATIHGMNRPFLLLMATPLER&QHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

II M I M I M Ml I M M M M 1 1 III M 1 1 1 M 1 1 1 Mill 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 283 



I II M II II II II M II I II 1 1 1 1 M I II II 1 1 1 1 1 M I M II 1 1 M 

YCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW! 
NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 

M I M I IMM II II M 1 1 II II I MM II M M 1 1 1 M I MMM I 



RESULT 3 
TGF INHUMAN 

ID TGF1_HUMAN STANDARD; PRT; 3 90 AA . 

AC P01137; Q9UCG4; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 OR TGFB . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I_Tax I D= 9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87174845; PubMed=3470709 ; 

RA Derynck R., Rhee L. , Chen E.Y., van Tilburg A.; 

RT "Intron-exon structure of the human transforming growth factor-beta 

RT precursor gene . 11 ; 

RL Nucleic Acids Res. 15:3188-3189(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85296301; PubMed=386194 0 ; 

RA Derynck R., Jarrett J. A. , Chen E.Y., Eaton D.H., Bell J.R., 

RA Assoian R.K., Roberts A.B., Sporn M.B., Goeddel D.V. ; 

RT "Human transforming growth factor-beta complementary DNA sequence and 

RT expression in normal and transformed cells."; 

RL Nature 316:701-705(1985). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Duodenum, and Eye; 

RX MEDLINE=223 8 8257; PubMed= 124 77 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM. , Schuler G.D., 



RA Altschul S.F., Zeeberg B. # Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J., Hsieh P., 

RA Diatchenko L., Marusina K. , Farmer A.A. , Rubin G.M., Hong L., 

RA Stapleton M . , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L. J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A . , Young A.C. , Shevchenko Y. , Bouffard G.G. , 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C, Grimwood J., Schmutz J . , Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 

RP SEQUENCE OF 279-390 FROM N.A. 

RC TISSUE=Carcinoma ; 

RA Urushizaki Y., Niitsu Y., Terui T. , Koshida Y., Mahara K. , Kohgo Y., 

RA Urushizaki I., Takahashi Y., Ito H. ; 

RT "Cloning and expression of the gene for human transforming growth 

RT factor-beta in Escherichia coli." ; 

RL Tumor Res. 22:41-55(1987). 

RN [5] 

RP SEQUENCE OF 279-329. 

RC TlSSUE=Bladder carcinoma; 

RX MEDLINE= 9322 9900; PubMed=847 184 6 ; 

RA Bourdrel L. , Lin C.-H., Lauren S.L., Elmore R.H., Sugarman B.J., 

RA Hu S., WestCOtt K.R.; 

RT "Recombinant human transforming growth factor-beta 1: expression by 

RT Chinese hamster ovary cells, isolation, and characterization."; 

RL Protein Expr. Purif. 4:130-140(1993). 

RN [6] 

RP SEQUENCE OF 279-301. 

RX MEDL1NE=85131019; PubMed=2982 82 9 ; 

RA Massague J . , Like B. ; 

RT "Cellular receptors for type beta transforming growth factor. Ligand 

RT binding and affinity labeling in human and rodent cell lines."; 

RL J. Biol. Chem. 260:2636-2645(1985). 

RN [7] 

RP STRUCTURE BY NMR OF 2 79-390. 

RX MEDLINE=93144319; PubMed=8424 942 ; 

RA Archer S.J., Bax A., Roberts A.B., Sporn M.B., Ogawa Y., Piez K.A. , 

RA Weatherbee J. A. , Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J., 

RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: NMR signal assignments of the 

RT recombinant protein expressed and isotopically enriched using Chinese 

RT hamster ovary cells."; 

RL Biochemistry 32:1152-1163(1993). 

RN [8] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=93144320; PubMed=8424 943 ; 

RA Archer S.J., Bax A., Roberts A.B., Sporn M.B., Ogawa Y., Piez K.A. , 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J., 



RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: secondary structure as determined 

RT by heteronuclear magnetic resonance spectroscopy . " ; 

RL Biochemistry 32:1164-1171(1993). 

RN [9] 

RP STRUCTURE BY NMR OF 279-3 90. 

RX MEDLINE=96266150 ; PubMed=8679613 ; 

RA Hinck A.P. , Archer S.J., Qian S.W., Roberts A.B., Sporn M.B., 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J . , 

RA Torchia D . A . ; 

RT "Transforming growth factor beta 1: three-dimensional structure in 

RT solution and comparison with the X-ray structure of transforming 

RT growth factor beta 2 . " ; 

RL Biochemistry 35:8517-8534(1996). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 
CC DIFFERENTIATION , AND OTHER FUNCTIONS IN MANY CELL TYPES . MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X05839; CAA29283.1; -. 

DR EMBL; X0584 0; CAA2 9283.1; JOINED. 

DR EMBL; X05843; CAA29283.1; JOINED. 

DR EMBL; X05844; CAA29283.1; JOINED. 

DR EMBL; X05849; CAA29283.1; JOINED. 

DR EMBL; X05850; CAA29283.1; JOINED. 

DR EMBL; X02812; CAA26580.1; ALT_SEQ . 

DR EMBL; BC001180; AAH01180.1; -. 

DR EMBL; BC000125; AAH00125.1; -. 

DR EMBL; BC022242; AAH22242.1; -. 

DR EMBL; M38449; AAA36735.1; -. 

DR PIR; A27513; WFHU2 . 

DR PDB; 1KLA; 17-AUG-96. 

DR PDB; 1KLC; 17-AUG-96. 

DR PDB; 1KLD; 17-AUG-96. 

DR Genew; HGNC : 11766; TGFB1 . 

DR MIM; 190180; -. 

DR GO; GO:0006916; P : ant i -apoptos is ; TAS . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; I PRO 03911; TGFJTGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 



DR 


PRINTS; PR01423; 


TGFBETA - 




DR 


ProDom; 1 


PD000357; 


TGFb; 1. 




DR 


SMART ; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF_BETA_1 ; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal; 3D-structure . 


FT 


S I GNAL 


1 


23 


POTENTIAL . 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1 . 


FT 


DISULFID 


285 


294 




FT 


DISULFID 


293 


356 




FT 


T~> T A T TT T""> T T-\ 

DISULFID 


322 


387 




FT 


DISULFID 


326 


38 9 




FT 


DISULFID 


355 


355 


TXTmnnAlTH T XT 

INTERCHAIN . 


FT 


CARBOHYD 


82 


82 


XT T TXTI/Pn /AT AkT T» A \ / r>^ r Pr , M r P T TV T \ 

N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


XT T TXTT/FTv /AT AXTTi A \ / nATPXTT 1 T 7\ T \ 

N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


X T T T\II/ T~l T~\ /AT ATT 7\ A \ / T"^ A.H~I FIMT X T\ T \ 

N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


Am T 7\ mm t\ AT TR/T nurm A T* rr-| 7""i / riATPXirn T 7\ T \ 

CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


10 


10 


T T""l / T XT D OO O ^ 

L - > P (IN REF . 2 ) . 


FT 


CONFLICT 


159 


159 


t~i n n / t it t~» t—i r~» \ 

R - > RR ( IN REF . 2 ) . 


FT 


STRAND 


2 81 


281 




FT 


TURN 


282 


287 




FT 


STRAND 


294 


296 




FT 


STRAND 


300 


300 




FT 


TURN 


302 


305 




FT 


STRAND 


313 


313 




FT 


STRAND 


317 


317 




FT 


STRAND 


321 


323 




FT 


HELIX 


335 


346 




FT 


TURN 


348 


349 




FT 


STRAND 


358 


370 




FT 


TURN 


371 


372 




FT 


STRAND 


373 


387 




SQ 


SEQUENCE 


390 AA 


.; 44341 MW; 


75391614250288FE CRC64 ; 



Query Match 89.3%; Score 1936.5; DB 1, 

Best Local Similarity 90.2%; Pred. No. 8.9e-148; 
Matches 367; Conservative 10; Mismatches 13; 



Length 3 90; 
Indels 17; Gaps 



1/ 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I MUM M M Ml M| 1 1, 1 1 1 1 1| [|| M 1 1 1 1 1 |M| 1 1 1 M MM I M M| | [ 

1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 
61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllllllllllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = hi 

61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 
121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : 11-11 M M I M M M M M Ml M 1 1 II I MM II MM M 1 1 1 1 1: 1 M 

121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 240 

I I I M I I I I I I I I I I I I I II I I I I I I II : I I I I I II I I II II I I : I I I I MINI 
181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

:||IIIIIIIIIIMIIIIIIIIIIIIIIIMIIIII Mill 
241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN 283 



Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

llllllllMIII! IIIIMIIII IM llhlLi l llllllllllllllllll 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 3 61 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 07 

I I I , I M I II I I I I II I I : II 1 1 1 1 II I I '1 1 1 : 1 I I M I I I I I I I 
Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 4 
TGF1 CERAE 



ID TGF1_CERAE STANDARD; PRT; 3 90 AA. 

AC P09533; 

DT 01-MAR-1989 (Rel . 10, Created) 

DT 01-MAR-1989 (Rel. 10, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chorda t a; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=87246074; PubMed=34 74 13 0 ; 

RA Sharpies K. , Plowman G.D., Rose T.M., Twardzik D.R., Purchio A.F.; 

RT "Cloning and sequence analysis of simian transforming growth 

RT factor-beta cDNA." ; 

RL DNA 6 : 239-244 (1987) . 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROL PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M16658; AAA35369.1; -. 

DR PIR; A26960; A26960. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF__TGFb . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb__N . 

DR Pfam; PF00019; TGF-beta; 1. 



DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423 ; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 
DR 
KW 
FT 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



PROSITE; 


PS00250; 


TGF_BETA_1 ; 


1. 


1 Growth factor; Mitogen; 


Glycoprotein; Signal . 


SIGNAL 


1 


16 




POTENTIAL . 


PROPEP 


17 


278 






CHAIN 


279 


390 




TRANSFORMING GROWTH FACTOR BETA 1. 


DISULFID 


285 


294 




BY SIMILARITY. 


DISULFID 


293 


356 




BY SIMILARITY. 


DISULFID 


322 


387 




BY SIMILARITY. 


DISULFID 


326 


389 




BY SIMILARITY. 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


82 


82 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


136 


136 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


SITE 


244 


246 




CELL ATTACHMENT SITE (POTENTIAL) . 


) SEQUENCE 


390 AA; 


44356 MW; 


DFF63E2BAB44320E CRC64 ; 


Query Match 




89 


.2%; 


Score 1934.5; DB 1; Length 390; 



Best Local Similarity 89.9%; Pred. No. 1.3e-147; 

Matches 366; Conservative 10; Mismatches 14; Indels 17; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllllllllllllllllll lllllllllllllllllllllll llllllllllll 
MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRKRI ETI RGQI LSKLRLA 60 

SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I ! I I ' : | ! I I II || ' II I | | '| I | | | | | | | MM I MM III M I ' M II M h| 
SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : I'M II II MM MM M I M I M I III M IM M II 1 1 II 1 1 M IM II 

YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 
YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCS CDS KDNTLHVE I NGFN 24 0 

Illllllllhllllllllllllllllhl MMMIMMMIMM hllll 

YLSNRLLAPSNS PEWLSFDVTG WRQWLSRGGE I EGFRLSAHCSCDS KDNTLQVDI NGFT 24 0 
SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

M 1 1 II I II 1 1 1 1 II 1 1 1 II M 1 1 II II I M II 1 1 1 1 Mill 

TGRRGDLAT I HGMNR P FLLLMATPLERAQHLQS SRHRR ALDTN 283 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

I I M M I II M M II I M I M I MM II 1 1 II M II M Ml II I IM I MM 1 1 1 MM II 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 
NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 07 

II II I M IM II Ml I IM II II II MM II II 1 1 1 M I II MM I M 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


284 


Qy 


361 


Db 


344 



RESULT 5 
TGF1_CANFA 

ID TGF1_CANFA STANDARD; PRT; 3 9 0 AA . 



AC P54831; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE= Jugular vein endothelial; 

RX MEDLINE=95237630; PubMed=772 1 1 1 0 ; 

RA Manning A.M., Auchampach J. A. , Drong R.F., Slightom J.L. ; 

RT "Cloning of a canine cDNA homologous to the human transforming growth 

RT factor-beta 1 -encoding gene."; 

RL Gene 155:3 07-3 08(1995). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul f ide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Biomf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L34956; AAA51458.1; -. 

DR PIR; JC4023; JC4023. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM0 02 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 BY SIMILARITY. 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 



r 1 DISULir ID 


326 


389 


BY SIMILARITY. 




FT DISULFID 


355 


355 


INTERCHAIN. 




FT CARBOHYD 


82 


82 


N- LINKED (GLCNAC. . . 


) (POTENTIAL) 


FT CARBOHYD 


136 


136 


N- LINKED (GLCNAC. . . 


) (POTENTIAL) 


FT CARBOHYD 


176 


176 


N -LINKED (GLCNAC. . . 


) (POTENTIAL) 


FT SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


SQ SEQUENCE 


390 AA; 


44185 MW; EB4780E88B7B590E CRC64; 


Query Match 




88.2%; 


Score 1913.5; DB 1; 


Length 3 90; 


Best Local Similarity 


89.2%; 


Pred. No. 6.2e-146; 




Matches 363; 


Conservative 


12; Mismatches 15; 


Indels 17; 



Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 
Db 



Gaps 1 ; 

1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I 1 1 ! 1 1 1 1 ! 1 1 1 1 M lllllll IMII III I II llllllll III MUM I'M Mr 

1 MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLS 60 
61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MMMMM MMMMMMMMMMI 1 1 M 1 1 1 1 M I M 1 1 1 1 1 1 1 1 h hi 

61 SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 12 0 
121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I MIMMMIMMMMMMM MMMMMMMM II I II 1 1 II 1 1 1 1 1 1 1 

121 YEKVKKSPHSI YMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 180 
181 YLSNRLLAPS DS P EWLS FDVTG WRQWLTRREA I EGFRLSAHCS CDS KDNTLH VE I NGFN 24 0 

1 1 1 1 1 1 1 1 1 M = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : Ml I MMMMIMM hlllh 

181 YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

I IMIMI MUM lllllllll IMIMM II Mill 

Db 241 SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

1 M M M I MM M MMM I II M M I MM M I M MIM Ml MMMMM III M MM 

Db 2 84 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 407 

MMMMMMMMMMI MMMM Mnl I hMM 

Db 3 44 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 6 
TGFl_HORSE 

ID TGFl_HORSE STANDARD; PRT; 3 90 AA . 

AC 019011; 

DT 15-JUL-1998 (Rel . 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Equus caballus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata,- Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=9796 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph node; 



RX MEDLINE=98185507; PubMed=9524 8 1 9 ; 

RA Penha-Goncalves M.N. , Onions D.E., Nicolson L. ; 

RT "Cloning and sequencing of equine transforming growth factor-beta 1 

RT (TGF beta-1) cDNA . " ; 

RL DNA Seq. 7:375-378(1997). 

CC -!- FUNCTION: TGF -BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES . MANY CELLS SYNTHESIZE TGF -BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF -BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS . 

CC -1- SUBUNIT: Homodimer; disul fide- linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X99438; CAA67801.1; 

DR HSSP; P01137; 1KLA . 

DR InterPro; I PRO 024 00; GF_cysknot . 

DR InterPro; I PRO 03911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




DR 


PRINTS; 


PR01423; 


TGFB ETA . 




DR 


ProDom ; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


23 


POTENTIAL . 


FT 


PROPEP 


24 


278 


BY SIMILARITY. 


FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


390 AA; 43974 MW; 


A86D715F44549691 CRC64 ; 



Query Match 85.7%; Score 1859.5; DB 1; Length 390; 

Best Local Similarity 87.0%; Pred . No. 1.3e-141; 

Matches 354; Conservative 12; Mismatches 24; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I 1 1 II II II M 1 1 1 1 1 ill i 1 1 'II I IIMI i 1 1 : 1 1 ! I I'l 1 1 1 1 1 1 1 'I I II I , I h 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


284 


Qy 


361 


Db 


344 



I I I I M I I I I 1 I I I I II I II I I I :||||| I I I I I I' I i I I I I II I I I I M 



YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I : hi lllllll I h I- I I I ; I ' I I I II. I I I I I I I I I I I I ! I : I I I 

YKTVETGSHSIYMFFNTSELRAAVPDPMLLSRAELRLLRLKLSVEQHVELYQKYSNNSWR 18 0 

YLSNRLLAP S DS P EWLS FDVTG WRQWLTRRE1AI EGFRLSAHCS CDS KDNTLHVE I NGFN 24 0 

lllllll ' I I I M h: : M I I I III ! I I I I I I I I I | lllh 

YLSNRLLTPSDSPEWLS FDVTG WRQWLSQGGAMEGFRLSAHCS CDS KDNTLRVGINGFS 24 0 

SGRRGDIATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 

I lllllll ' I Ml JMI IMIIIII Mill 

SSRRGDLATIDGMNRPFLLLMATPLERAQQLHSSRHRR ALDTN 28 3 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 r r i r 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 



I f t I I M I I I I I I I I I II I II I I I I ' I I II I I , I I I ; Ml I I 



RESULT 7 
TGF1J40USE 

ID TGF1_M0USE STANDARD; PRT; 390 AA. 

AC P04202; 

DT 20-MAR-1987 (Rel . 04, Created) 

DT 20-MAR-1987 (Rel. 04, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=86168129; PubMed=3007454 ; 

RA Derynck R., Jarrett J. A. , Chen E.Y., Goeddel D.V.; 

RT "The murine transforming growth factor-beta precursor."; 

RL J. Biol. Chem. 261:4377-4379(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RX MEDLINE=96096545; PubMed=8522200 ; 

RA Guron C, Sudarshan C. , Raghow R.; 

RT "Molecular organization of the gene encoding murine transforming 

RT growth factor beta 1 . " ; 

RL Gene 165:325-326(1995). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAlN=C57BL/6, and NOD/LT; TISSUE=Spleen ; 

RA Poirot L. , Benoist C. , Mathis D. ; 



RT "Transforming growth factor-beta 1 sequence and expression: no 

RT difference between NOD/Lt and C57B1/6 mouse strains."; 

RL Submitted (AUG- 1 998 ) to the EMBL/ GenBank/DDBJ databases. 

CC -1- FUNCTION : TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS . 

CC -!- SUBUNIT: Homodimer; disul f ide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -1- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M13177; AAA40423.1; 

DR EMBL ; L42462; AAB00138.1; -. 

DR EMBL; L42456; AAB00138.1; JOINED. 

DR EMBL; L42457; AAB00138.1; JOINED. 

DR EMBL; L42458; AAB00138.1; JOINED. 

DR EMBL; L42459; AAB00138.1; JOINED. 

DR EMBL; L42460; AAB00138.1; JOINED. 

DR EMBL; L42461; AAB00138.1; JOINED. 

DR EMBL; AJ009862; CAA08900.1; -. 

DR PIR; A01396; WFMS2 . 

DR HSSP; P01137; 1KLA. 

DR MGD; MGI: 98725; Tgfbl . 

DR GO; GO:0005578; C : extracel lular matrix; IDA. 

DR GO; GO:0006954; P : inflammatory response ; IMP. 

DR GO; GO: 0007515; P: lymph gland development; IMP. 

DR GO; GO: 0008220; P:necrosis; IMP. 

DR GO; GO: 0016202; P: regulation of myogenesis; IDA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb__propept ide ; 1. 

DR PRINTS; PRO 04 3 8; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD0 003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF__BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 

FT DISULFID 326 389 BY SIMILARITY. 



FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N- LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44310 MW; 


4381A51B711D689E CRC64 ; 


Query Match 




85.5%; 


Score 1855.5; DB 1; Length 390; 



Best Local Similarity 85.5%; Pred. No. 2.8e-141; 

Matches 348; Conservative 15; Mismatches 27; Indels 17; Gaps 1; 
Qy 1 MAP SGLRLLPLLLPLLWLLVLT PGR PAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

I MIIIIIIIMII MM II I M II 1 1 1 II I II I Ml I 1 1 1 M 1 1 1 II 1 1 1 II MM 

Db 1 MP PSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

II. 1 1 1 h I M 1 1 1 ; II , II I II I ! I II 1 1 M I U 1 1 , 1 1 1 1 1 1 1 1 II 1 1 1 1 1: II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I Ihll MlhMIIMII .Mil 1 Ml , II I II I ,'l I! !h II I 

Db 121 YEKTKDISHSI YMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

II I I I I hhllllllllllllllll : : hill Mlllllllll lllllll 

Db 181 YLGNRLLTPTDTPEWLS FDVTG WRQWLNQGDG I QGFR FSAHCSCDS KDNKLHVE I NG I S 240 

Qy 241 SGRRGDLAT I HGMNR P F LLLMAT PLERAQHLHS S RHRRALDTN S Y P YDVPD YAS LALDTN 300 

Mill Ml M I M II Ml Ml II III 1 1 1 1 1 1 1 1 Mill 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

1 1 1 1 11 II 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f I 

Db 284 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 407 

I II MM I M I M M M III III II M I II M II 1 1 Ml M 1 1 M II 

Db 344 NQHNPGASASPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 8 
TGF1_RAT 

ID TGF1_RAT STANDARD; PRT; 3 90 AA. 

AC P17246; 

DT 01-AUG-1990 (Rel . 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ,- 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCB I _TaxI D= 10116; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Heart ; 

RX MEDLINE=90272425; PubMed=234 9 108 ; 



RA Qian S.W., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "cDNA cloning by PCR of rat transforming growth factor beta-1." ; 

RL Nucleic Acids Res. 18:3059-3059(1990). 

CC -1- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-lmked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; X52498; CAA36741.1; -. 

DR PIR; S10219; S10219. 

DR HSSP; P01137; 1KLA. 

DR Inter Pro; I PRO 024 00; GF_cysknot . 

DR InterPro; I PRO 03 9 11; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0 003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 002 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULF ID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44329 


MW; 5E21108ED50D853C CRC64 ; 



Query Match 85.5%; Score 1855.5; DB 1; Length 390; 

Best Local Similarity 85.5%; Pred. No. 2.8e-141 ; 

Matches 348; Conservative 14; Mismatches 28; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

i iiiiiiiiiiiii 'i 1 1 1 1 1 iii mi i ii 1 1 1 1 1 1 1 'i 1 1 1 iii 1 1 1 :i i iii 1 1 ii 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


284 


Qy 


361 


Db 


344 



SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IMIIhllllllMIIIIIIIIIIIIIIIII : I; I 1 1 II I II 1 1 h 1 1 !l !•• II 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Ml I 11 = 11 lll|::|IMMI MINIMI I I I I I I I I I I I I I I h I I I 
YDKTKDITHSI YMFFNTSDIREAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 18 0 



II MM h I : I I M I I I I I I I I I I I I = : hill 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M I 



SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 

Mill Ml I I I I I I I I I I I I I II I I I I I II I I I I Mill 
PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTN 283 

YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

MMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIIIIIIIIIIIMIIIIIM 

YCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 343 
NQHNPGASAAPGCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 1 1 1 1 1 1 1 ^ 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 j 1 1 1 r j 

NQHNPGASASPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 9 
TGF1_CAVP0 

ID TGF1_CAVP0 STANDARD; PRT; 3 90 AA. 

AC Q9Z1Y6; Q9QZB3 ; Q9R148; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI JTaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Hartley; 

RA Jeevan A., McMurray D.N., Yoshimura T. ; 

RT "Guinea pig transforming growth factor-beta in peritoneal exudates 

RT after BCG vaccination . »' ; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 265-3 82 FROM N.A. 

RX MEDLI NE= 99144670; PubMed = 1002 5 978 ; 

RA Scarozza A.M., Ramsingh A.I., Wiener V., Wicher K. ; 

RT "Spontaneous cytokine gene expression in normal guinea pig blood and 

RT tissues."; 

RL Cytokine 10:851-859(1998). 

RN [3] 

RP SEQUENCE OF 279-371 FROM N.A. 

RC STRAIN=Hartley; TISSUE=Trachea ; 

RA Morishima Y . , Uchida Y., Nomura A., Ishii Y. , Sakamoto T. , 



RA Sekizawa K. ; 

RT "Guinea-pig transforming growth factor-beta expression in injured 

RT tracheal epithelium."; 

RL Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 

CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-linked (By similarity). 

CC -\- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 



CC 












DR 


EMBL; AF191297; 


AAF02780. 


1; - 




DR 


EMBL; AF097509; 


AAC83807. 


l; - 




DR 


EMBL; AF169347; 


AAD49347 . 


1; - 




DR 


HSSP; P01137; UCLA. 






DR 


InterPro; 


IPR002400; GF_cysknot . 


DR 


InterPro; 


IPR003 


911; TGF 


TGFb. 


DR 


InterPro ; 


IPR001839; TGFb . 




DR 


InterPro; 


IPR001111; TGFb 


_N. 




DR 


Pfam; PF00019; TGF-beta; 


1 . 




DR 


Pfam; PF00688; TGFb_propept ide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT . 




DR 


PRINTS; PR01423; 


TGFB ETA . 






DR 


ProDom; PD000357 


; TGFb; 1 






DR 


SMART; SM002 04; 


TGFB; 1. 






DR 


PROSITE; 


PS00250; TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


24 




POTENTIAL. 


FT 


PROPEP 


25 


278 




POTENTIAL . 


FT 


CHAIN 


279 


390 




TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 




BY SIMILARITY. 


FT 


DISULFID 


293 


356 




BY SIMILARITY. 


FT 


DISULFID 


322 


387 




BY SIMILARITY. 


FT 


DISULFID 


326 


389 




BY SIMILARITY. 


FT 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 




N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 




CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


279 


279 




G - > P (IN REF. 3) . 


FT 


CONFLICT 


286 


286 




F -> S (IN REF. 2) . 


FT 


CONFLICT 


309 


309 




K -> E (IN REF. 2) . 


FT 


CONFLICT 


322 


322 




C -> R (IN REF. 2) . 


FT 


CONFLICT 


350 


350 




A -> G (IN REF. 2) . 


SQ 


SEQUENCE 


390 AA; 44328 


MW; 


1539F849BA0C0FF1 CRC64 ; 



Query Match 85.1%; Score 1846.5; DB 1; Length 390; 

Best Local Similarity 85.3%; Pred. No. 1.5e-140; 

Matches 347; Conservative 15; Mismatches 28; Indels 17; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I II llllllllllllllll IIMMIMI III IIMIIIIMIIIIMIII MM 

Db 1 MPPSRLRLLPLLLPLLWLLVLAPGRPASGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I MINIMI INN MM Ml Ml! MM II MMMI I II M M M II M * • ■■ I 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEPDYYAKEVTRVLMVDNSHNI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I = Ihll I I I I J I I I i I I : : I I I I III: I I I I II I II II M I I I M I II 
Db 121 YKS I ETVAHS I YMFFNTS ELREAVPDPLLLSRAELRMQRLKLNVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDS PEWLS FDVTG WRQWLTRREA I EG FRLSAHCSCDS KDNTLHVE I NGFN 240 

Mlhll llhlllMMIIIIIIIIh: I Mill IIIIIMIMIII Mill 

Db 181 YLSNQLLTPSDTPEWLSFDVTGWRQWLSQGEELEGFRFSAHCSCDSKDNTLRVEINGIG 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 3 00 

I M M I MIMMMMMMMMMMMMM MM 

Db 241 PKRRGDLAAIHGMNRPFLLLMATPLERAQHLHSSRHRR GLDTN 283 

Qy 3 01 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 3 60 

MM MM II MM II III Ml II M III MM II MM III Mill IMMI II II II I 

Db 2 84 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 34 3 

Qy 361 NQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 407 

ill; imii! linn mi ill iii in, i . 1 1 1 ; i ' 1 1 1 1 1 1 1 1 

Db 344 NQHNPGASAAPCCVPQALEPLPI VYYVGRKAKVEQLSNMIVRSCKCS 390 



RESULT 10 
TGFl_BOVIN 

ID TGFl_BOVIN STANDARD; PRT; 315 AA. 

AC P18341; 

DT 01-NOV-1990 (Rel. 16, Created) 

DT 01-NOV-1990 (Rel. 16, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (Fragment) . 

GN TGFB1 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=9104 2552; PubMed=3 1 534 5 9 ; 

RA van Obberghen- Schil 1 ing E . , Kondaiah P., Ludwig R.L. , Sporn M.B., 

RA Baker C. C. ; 

RT "Complementary deoxyribonucleic acid cloning of bovine transforming 

RT growth factor-beta 1 . " ; 

RL Mol . Endocrinol. 1:693-698(1987). 

RN [2] 

RP SUBUNITS. 



RC TISSUE=Bone; 

RX MEDLINE=9212 93 07; PubMed= 1733 93 6 ; 

RA Ogawa Y., Schmidt D.K., Dasch J.R., Chang R.J., Glaser C.B.; 

RT "Purification and characterization of transforming growth factor-beta 

RT 2.3 and -beta 1.2 heterodimers from bovine bone."; 

RL J. Biol. Chem. 267:2325-2328(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . Heterodimers of TGF-beta 1/2 

CC have been found in bone. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; M36271; AAA30778.1; -. 

DR PIR; A40057; A40057. 

DR HSSP; P01137; 1KLA. 

DR Inter Pro; IPR0024 00; GF_cysknot . 

DR InterPro; I PRO 03 9 11; TGF_TGFb. 

DR InterPro; I PRO 01 83 9; TGFb. 

DR InterPro; I PRO 01 111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS0025 0; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 

FT NONJTER 1 1 

FT PROPEP <1 203 

FT CHAIN 204 315 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 210 219 BY SIMILARITY. 

FT DISULFID 218 281 BY SIMILARITY. 

FT DISULFID 247 312 BY SIMILARITY. 

FT DISULFID 251 314 BY SIMILARITY. 

FT DISULFID 280 280 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 7 7 N- LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 61 61 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 101 101 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 169 171 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 315 AA; 36269 MW; C2 717A2 3D9 94E0 OE CRC64 ; 



Query Match 73.4%; Score 1592.5; DB 1; Length 315; 

Best Local Similarity 89.5%; Pred . No. 2.5e-120; 

Matches 297; Conservative 9; Mismatches 9; Indels 17; Gaps 1; 



Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

h I I I I I I II M I I I I I I I II I I I I I II I I 'I I I ! I I I ' : I 1 I I | : MMI | 
Db 1 AILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI YDKMKSSSHSI YMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

II I I I I I I II M I I I I ImM I III I I I I II I MM I M I I M M M I I I I I I M I I I l| I 
Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 12 0 

Qy 196 LSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNR 255 

1 1 1 1 1 1 1 1 II Ml 1 1 1 1 I M I III II 1 1 II I MM I . h 1 1 1 h M 1 1 1 1 1 II I Mil I 

Db 121 LS FDVTG WRQWLTRREE I EGFRLSAHCS CDS KDNTLQVD I NGFSSGRRGDLAT I HGMNR 18 0 

Qy 2 56 PFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQL 315 

1 1 1 1 1 1 1 1 II : l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M Ml I II II I 

Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQL 223 

Qy 316 YIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVP 375 

MMMMMMMMMMMMMMIMMMMMMMMMMMMMMMI 

Db 224 YIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVP 283 

Qy 3 76 QALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

I M 1 1 M III II M M I II ll I II 1 1 Ml II; 

Db 284 QALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 11 
TGF1_CHICK 

ID TGF1_CHICK STANDARD; PRT; 373 AA. 

AC P09531; 

DT 01-MAR-1989 (Rel . 10, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 4) 

DE (Fragment) . 

GN TGFB1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gall if ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=White leghorn; 

RX MEDLINE=89112198; PubMed=24 64 13 1 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a messenger 

RT ribonucleic acid encoding transforming growth factor beta 4 from 

RT chicken embryo chondrocytes."; 

RL Mol. Endocrinol. 2:1186-1195(1988). 

RN [2] 

RP REVISIONS. 

RX MEDLINE=92357039; PubMed=13 53 8 6 0 ; 

RA Burt D.W., Jakowlew S.B.; 

RT "Correction: a new interpretation of a chicken transforming growth 

RT factor-beta 4 complementary DNA." ; 

RL Mol. Endocrinol. 6:989-992(1992). 



CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -'. - SUBCELLULAR LOCATION : Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Biomf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M31160; AAB05637.1; -. 

DR PIR; A41918; A41918 . 

DR HSSP; P01137; 1KLA. 

DR Inter Pro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR01423; 


TGFB ETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM002 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


NON TER 


1 


1 




FT 


SIGNAL 


<1 


1 


POTENTIAL. 


FT 


PROPEP 


2 


259 


POTENTIAL. 


FT 


CHAIN 


260 


373 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


266 


277 


BY SIMILARITY. 


FT 


DISULFID 


276 


339 


BY SIMILARITY. 


FT 


DISULFID 


305 


370 


BY SIMILARITY. 


FT 


DISULFID 


309 


372 


BY SIMILARITY. 


FT 


DISULFID 


338 


338 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


54 


54 


N- LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


109 


109 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


153 


153 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


224 


226 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


373 AA 


; 42634 MW; 


9903F3479C8552E5 CRC64 ; 



Query Match 49.2%; Score 1067; DB 1; Length 373; 

Best Local Similarity 53.5%; Pred. No. 4e-78; 

Matches 209; Conservative 52; Mismatches 98; Indels 32; Gaps 8; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

II h :| = | ' : I I I I I : , I I I I I M I I :|| : || |lh I II II II - 
Db 2 LSTCQRLDLEAAKKKRIEAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES - VEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPV 14 8 

: : | |: :|:|||: |: | : : : := ||:: :|| I I I 
Db 62 QRARLRPPPDGPDEYWAKELRRI PMETTWDGAMEHWQPQSHSIFFVFNVSRARRG-GRPT 120 



Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 203 

II MINI I || :|||| | | HIM I : : I I I I I I I I 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 180 

Qy 204 VRQWLTRREAI EGFRLSAHCSCD S KDNTLH VE I NGFNSGRRGDLAT I HGMNR - - PFL 258 

I III: I = 1 = 11 llh : = I II : I I h = I :| h = 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMQSIAKKHRRVPYV 239 

Qy 259 LLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCF- -SSTEKNCCVRQLY 316 

I II I III IN || I I I : I I I : Illllll II 

Db 24 0 LAMALPAERANELHSARRRR DLDTDYCFGPGTDEKNCCVRPLY 282 

Qy 317 IDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQ 376 

Illllll lllillllll lllhllllllll lllhllMIIIIIMIIIIMIIIM 

Db 283 IDFRKDLQWKWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQ 342 

Qy 377 ALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

hlllhlllll = I I I I I I I ^ I I = I I I I 

Db 343 TLDPLPI I YYVGRNVRVEQLSNMWRACKCS 373 

RESULT 12 
TGF1_XENLA 

ID TGF1_XENLA STANDARD; PRT; 382 AA. 

AC P16176; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX N CB I JTax I D = 8 3 5 5 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=9 0 110090; PubMed=229560 1 ; 

RA Kondaiah P., Sands M.J., Smith J.M. , Fields A., Roberts A.B., 

RA Sporn M.B., Melton D.A.; 

RT "Identification of a novel transforming growth factor-beta (TGF-beta 

RT 5) mRNA in Xenopus laevis."; 

RL J. Biol. Chem. 265:1089-1093(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Vempati U.D., Kondaiah P.; 

RL Submitted (AUG- 1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: IMPORTANT ROLE IN CERTAIN ASPECTS OF DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch) . 



EMBL; J05180; AAA49968.1; - 

EMBL ; AF009335; AAB64441.1; 

EMBL; AF009331; AAB64441. 

EMBL; AF009332; AAB64441. 

EMBL; AF009333; AAB64441. 

EMBL; AF009334; AAB64441. 



1; 

1; 

1; 
1; 



JOINED. 
JOINED. 
JOINED. 
JOINED. 



AF009332; 
AF009333; 
AF009334; 
PIR; A34929; B61036. 
HSSP; P01137; 1KLA. 
InterPro; IPR002400; GF_cysknot . 
Inter Pro; I PRO 03 911; TGF_TGFb . 
InterPro; IPR001839; TGFb. 
InterPro ; IPR001111; TGFb__N . 
Pfam; PF00019; TGF-beta; 1. 
Pfam; PF00688; TGFb_propept ide ; 1. 
PRINTS; PR00438; GFCYSKNOT . 
PRINTS; PRO 1423; TGFB ETA . 
ProDom; PD0 00357; TGFb; 1. 
SMART; SM00204; TGFB; 1. 
PROSITE; PS0025 0; TGF_BETA_1 ; 1. 
Growth factor; Mitogen; Glycoprotein; Signal. 

POTENTIAL . 



FT 


SIGNAL 


1 


21 


FT 


PROPEP 


22 


270 


FT 


CHAIN 


271 


382 


FT 


DISULFID 


277 


286 


FT 


DISULFID 


285 


348 


FT 


DISULFID 


314 


379 


FT 


DISULFID 


318 


381 


FT 


DISULFID 


347 


347 


FT 


CARBOHYD 


73 


73 


FT 


CARBOHYD 


123 


123 


FT 


CARBOHYD 


166 


166 


FT 


SITE 


234 


236 


SQ 


SEQUENCE 


382 AA; 


442 


Query Match 




40 



TRANSFORMING GROWTH FACTOR BETA 1 . 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
INTERCHAIN (BY SIMILARITY) . 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
CELL ATTACHMENT SITE (POTENTIAL) . 
; 1034621C917AAE15 CRC64 ; 



Best Local Similarity 46.4%; Pred. No. le-63; 

Matches 192; Conservative 54; Mismatches 121; Indels 47; Gaps 



11; 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



9 LPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDV 68 

: M II I I I I I Mill I |:::||IMI|||||||:| I I 

1 MEVLWMLLVXLVXHLSSLAMSLSTCKAVDMEEVRKRRI EAI RGQI LSKLKLDKTPDV- DS 5 9 

69 PPGPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDK 123 

: I = Mill = = == I : II M M I I II == I 

6 0 EKMTVPSEAI FLYNSTLEVI REKATREEEHVGHDQN I QDYYAKQVYR - - - FES I TELED - 115 

124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL- -KVEQHVELYQKYSNDSW-- 179 

I M I Ml I II lllh : : ::| : || : || : 

116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 17 0 

180 - RYLSNRLLAPSDS PEWLS FDVTG WRQWLTRREAI EGFRLSAHCSC DSKDNTLHVE 235 

III : I IIMIMI I Ml I I I II II Ml 
171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 



Qy 236 INGFNSGRRGDIATIHGM--NRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYA 293 

I II Mill:: :|:|:: : | || : ||l =1 
Db 227 I EGF- PALRGDLASLSSKENTKPYLMITSMPAERI DTVTSSRKKR 270 

Qy 294 SLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQY 353 

= III : 1 1 1 h 1 1 hi I III I II 1 1 II I III Ihlll III I Ih MM 

Db 271 --GVGQEYCFGNNGPNCCVKPLYINFRKDLGW?CWIHEPKGYEANYCLGNCPYIWSMDTQY 328 

Qy 3 54 SKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

Nihil Ihlll II -.Mill II Mlhll I II MM III hi I II M 

Db 329 SKVLSLYNQNNPGASISPCCVPDVTiEPLPI I YYVGRTAKVEQLSNMWRSCNCS 382 



RESULT 13 
TGF3_CHICK 

ID TGF3_CHICK STANDARD; PRT; 412 AA . 

AC P16047; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves ; Neognathae; Gall iformes ; Phasianidae; Phasianinae; 

OC Gallus . 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=89096966; PubMed=321 115 8 ; 

RA Jakowlew S.B., Dillard P.J., Kondaiah P., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a novel transforming 

RT growth factor-beta messenger ribonucleic acid from chick embryo 

RT chondrocytes . n ; 

RL Mol . Endocrinol. 2:747-755(1988). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=95169270; PubMed=786512 9 ; 

RA BurtD.W., DeyB.R., Paton I.R., Morrice D.R., Law A.S.; 

RT "The chicken transforming growth factor-beta 3 gene: genomic 

RT structure, transcriptional analysis, and chromosomal location."; 

RL DNA Cell Biol. 14:111-123(1995). 

RN [3] 

RP SEQUENCE OF 1-117 FROM N.A. 

RC STRAIN=White leghorn; TISSUE=Blood; 

RX MEDLINE=92134496; PubMed=1840616 ; 

RA Burt D.W., Dey B.R., Paton I.R.; 

RT "Comparative analysis of human and chicken transforming growth 

RT factor-beta 2 and -beta 3 promoters."; 

RL J. Mol. Endocrinol. 7:175-183(1991). 

RN [4] 

RP SEQUENCE OF 1-117 FROM N.A. 

RX MEDLINE-93024487; PubMed=1406706 ; 

RA Jakowlew S.B., Lechleider R. , Geiser A.G., Kim S.J., 

RA Santa-Coloma T.A., Cubert J., Sporn M.B., Roberts A.B.; 

RT "Identification and characterization of the chicken transforming 



RT growth factor-beta 3 promoter."; 

RL Mol. Endocrinol. 6:1285-12 98(1992). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENES I S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul f ide- 1 inked . 

CC -!- SUBCELLULAR LOCATION : Secreted . 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M31154; AAA49089.1; 

DR EMBL; X58127; CAA41128.2; -. 

DR EMBL; X60055; CAA41128.2; JOINED. 

DR EMBL; X60091; CAA41128.2; JOINED. 

DR EMBL; X60090; CAA41128.2; JOINED. 

DR EMBL; S46000; AAB23575.1; -. 

DR PIR; A34939; A34939. 

DR HSSP; PI 0600; 1TGJ . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




DR 


PRINTS ; 


PR01423; 


TGFB ETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM002 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


300 




FT 


CHAIN 


301 


412 


TRANSFORMING GROWTH FACTOR BETA 3 . 


FT 


DISULFID 


307 


316 


BY SIMILARITY. 


FT 


DISULFID 


315 


378 


BY SIMILARITY. 


FT 


DISULFID 


344 


409 


BY SIMILARITY. 


FT 


DISULFID 


348 


411 


BY SIMILARITY. 


FT 


DISULFID 


377 


377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 


74 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


135 


135 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 


142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 


263 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


323 


326 


ELPT -> DFRQ (IN REF . 1) . 


SQ 


SEQUENCE 


412 AA 


; 47077 MW; 


1CAB883170069D55 CRC64 ; 



Query Match 4 0.1%; Score 869.5; DB 1; Length 412; 

Best Local Similarity 45.5%; Pred. No. 3.1e-62; 

Matches 195; Conservative 56; Mismatches 117; Indels 61; Gaps 13; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

1 = I =1= : Ihl h I = I :hl|:|llllllllllll III I I \ 

Db 9 LVLLSLLS FATVSLALS S CTTLDLEH I KKKR VEA I RGQ I LS KLRLTS PPE - - SVG PAHVP 66 



Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVE SGNQIYDKF 124 

Mllllllh : I U I I h'llllh = M = 1 = = 

Db 67 YQILALYNSTRELL- -EEMEEEKEESCSQENTESEYYAKEI HKFDMIQGLPEHNELGICP 124 

Qy 125 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND 177 

II - II I I I III 1-11= II :||:| I 

Db 125 KGVTSNVFR - FNVS SAEKNSTNLFRAEFRVLRVPNPSSKRSEQR I ELFQ I LRPDEHI 180 

Qy 178 -SWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFRLSAHCSC D 226 

I I I I I : llllllllll Ihll Ih I I'M 

Db 181 AKQRYLSGRNVQTRGSPEWLSFDVTDTVREWLLHRESNLGLE I S I HCPCHTFQPNGDI LE 24 0 

Qy 227 SKDNTLHVEINGFNSG RRGDLATI HGMNRPFLLLMATPLERAQH- -LHSSRHRR 278 

I :: I =1 I I I I : := I hll I I = I I =| 

Db 241 NLHEVLE I KFKG I DSEDDYGRGDLGRLKKQKDLHNPHLI LMMLPPHRLES PTLGGQRKKR 300 

Qy 27 9 ALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANF 33 8 

I M II I II = hlllil llh MlllhllllM I I I 
Db 3 01 ALDTNYCFRNLEENCCVR PLY I ELPTDLGWKWVHEPKGYFANF 34 3 

Qy 339 CLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSN 398 

I M Ih I I I :| || Ml II llhllllll MM hlllil I I II I II I 
Db 344 CSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSN 403 

Qy 399 MIVRSCKCS 407 

hhlllll 

Db 4 04 MWKSCKCS 412 

RESULT 14 
TGF3_PIG 

ID TGF3_PIG STANDARD; PRT; 409 AA . 

AC P15203; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla,- Suina; Suidae; Sus. 

OX NCBI_TaxID=9823; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=8 909112 0; PubMed=32 08 74 6 ; 

RA Derynck R., Lindquist B., Lee A., Wen D. , Tamm J., Graycar J.L., 

RA Rhee L. , Mason A.J., Miller D.A. , Coffey R.J., Moses H.L., 

RA Chen E . Y . ; 

RT "A new type of transforming growth factor-beta, TGF-beta 3."; 

RL EMBO J. 7:3737-3743(1988). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESI S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!» SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; X14150; CAA32363.1; -. 

DR PIR; S01825; S01825. 

DR HSSP; P10600; 1TGJ. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS ; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 18 POTENTIAL. 

FT PROPEP 19 297 

FT CHAIN 298 409 TRANSFORMING GROWTH FACTOR BETA 3. 

FT DISULFID 304 313 BY SIMILARITY. 

FT DISULFID 312 375 BY SIMILARITY. 

FT DISULFID 341 406 BY SIMILARITY. 

FT DISULFID 345 408 BY SIMILARITY. 

FT DISULFID 374 374 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 72 72 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 133 133 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 140 140 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 259 261 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 409 AA; 46814 MW; B4 900235B5CC955E CRC64 ; 

Query Match 39.2%; Score 851; DB 1; Length 409; 

Best Local Similarity 44.6%; Pred. No. 9.3e-61; 

Matches 190; Conservative 58; Mismatches 122; Indels 56; Gaps 13; 

Qy 15 LLWLLVLTPGR PAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRLAS PPSQGDVP PG PLP 74 

h I :| : hi : I II I : I I I I I I I M I I I I III = * I 

Db 7 L WLALLNFAT VSLSMSTCTTLDFDH I KRKRVEAI RGQ I LS KLRLTS P PDPSML - -AN I P 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMV ESGNQI YDKFKG 126 

II MINI : I II = = l-llllh I = I I = II 

Db 65 TQVLDLYNSTRELLEEVHGERGDDCTQENTESEYYAKEI YKFDMI QGLEEHNDLAVCPKG 124 

Qy 127 TPHS LYMLFNTS ELREAVPE P VLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: || | : : I III h II : : || =11 = 1 I 

Db 12 5 I TSKI FR - FNVSSVEK NETNLFRAEFRVLRMPNPSSKRSEQRI ELFQI LQPDEHI AK 18 0 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSC DSK 228 

II-- = I -.11111111 I I'- I I III: I HIM 

Db 181 QRYI DGKNLPTRGAAEWLSFDVTDTVREWLLRRESNLGLEI S I HCPCHTFQPNGDI LENI 24 0 



Qy 229 DNTLHVEINGFNS GRRGDLATI HGM- -NRPFLLLMATPLERAQH- -LHSSRHRRALD 281 

: :: I • I I I I I : : I hll I • I : I : I =| 
Db 241 QEVMEI KFKGVDSEDDPGRGDLGRLKKKKEHS PHLI LMMI PPDRLDNPGLGAQRKKR 2 97 

Qy 282 TNSYPYDVPDYASIALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLG 341 

Illlllll : hlllll I I I I I I ^ I I I ' I ^ I 1 I I i I : I i M I 
Db 298 ALDTNYCFRNLEENCCVRPLY I DFRQDLGWKWVHEPKGYYANFCSG 343 

Qy 342 PCPY I WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI V 401 

MM: I II --I II Ml M 1 1 h 1 1 M II MM I - II I II MM I II Ml 

Db 344 PCPYLRSADTTHSSVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTAKVEQLSNMVV 4 03 

Qy 402 RSCKCS 407 

MIMI 

Db 4 04 KSCKCS 4 09 



RESULT 15 
TGF3_MOUSE 

ID TGF3_MOUSE STANDARD; PRT; 410 AA. 

AC P17125; 

DT 01-AUG-1990 (Rel . 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCB I_Tax I D= 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=9 01 90650; PubMed=2 628 73 0 ; 

RA Miller D.A., Lee A., Matsui Y. , Chen E.Y., Moses H.L., Derynck R.; 

RT "Complementary DNA cloning of the murine transforming growth 

RT factor-beta 3 (TGF beta 3) precursor and the comparative expression 

RT of TGF beta 3 and TGF beta 1 messenger RNA in murine embryos and 

RT adult tissues. "; 

RL Mol. Endocrinol. 3:1926-1934(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91000714; PubMed=22 06556 ; 

RA Denhez F. , Lafyatis R . , Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "Cloning by polymerase chain reaction of a new mouse TGF-beta, 

RT mTGF-beta 3 . " ; 

RL Growth Factors 3:13 9-146(1990). 

CC -!- FUNCTION: INVOLVED IN EMBR YOGENES I S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M32745; AAA40422.1; -. 

DR PIR; A41397; A41397 . 

DR HSSP; PI 0600; 1TGJ . 

DR MGD; MGI: 98727; Tgfb3 . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA__1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


298 


POTENTIAL. 


FT 


CHAIN 


299 


410 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


305 


314 


BY SIMILARITY. 


FT 


DISULFID 


313 


376 


BY SIMILARITY. 


FT 


DISULFID 


342 


407 


BY SIMILARITY. 


FT 


DISULFID 


346 


409 


BY SIMILARITY. 


FT 


DISULFID 


375 


375 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


72 


72 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


133 


133 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


140 


140 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


259 


261 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


410 AA; 


46884 


MW; 250F7048CA432BD6 CRC64 ; 


Query Match 




38 . 9 s 


k; Score 844.5; DB 1; Length 410; 


Best Local Similarity 


44 .2 s 


k ; Pred. No. 3.1e-60; 


Matches 188; 


Conservative 


57; Mismatches 127; Indels 53; 



Gaps 12, 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :| : I I I I hi :|:||:||||||||||||| III : 

Db 7 L WLALLNLATI SLSLSTCTTLDFGH I KKKRVEA I RGQ I LS KLRLTS P PE PS VMT - - HVP 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 

MINIMI = = M I = hMMIh ■ l = = l = = II 

Db 65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: || I : : MM h I h : M MIM I 

Db 125 ITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQILRPDEHIAK 180 

Qy 17 9 WRYLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFRLSAHCSC DSK 22 8 

lh = I = MMMM Ihll Mh I Mill 

Db 181 QRY I GGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEI S I HCPCHTFQPNGDI LENV 24 0 

Qy 229 DNTLHVE I NGFNS GRRGDLAT I HGM NRPFLLLMATPLERAQHLHS SRHRRALDT 282 

: - I == MM = = I hll I I h 

Db 241 HE VME I KFKG VDNEDDHGRGDLGRLKKQKDHHN PHLI LMMI P PHRLDS PGQGSQRK 296 



Qy 



283 NSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGP 342 



MINIM : h I I I I I I ! I I < h I ^ I I I I I ! : I I I I | I 

Db 297 KKALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGP 345 

Qy 343 CPYIWSLDTQYSKVIALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVR 402 

MM I II : I II Ml M MMIIIMI MM hlllll IIIIIIMhh 

Db 346 CPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWK 405 

Qy 4 03 SCKCS 4 07 

Mill 

Db 4 06 SCKCS 410 



Search completed: October 28, 2003, 09:08:42 
Job time : 9.15764 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 28, 2003, 07:50:55 ; Search time 32.6306 Seconds 

(without alignments) 
3218.683 Million cell updates/sec 



Title: 
Perf ec 
Sequence : 

Scoring table 



US-10-017-372E-39 
Perfect score: 2169 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 4 07 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



830525 



Database 



SPTREMBL 23 : * 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 



sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp^organelle: * 
sp_phage : * 

sp__plant : * 

sp_rodent : * 

sp__virus : * 

sp__vertebrate : * 



14 : sp_unclassif ied : * 

15: sp_rvirus : * 

16 : sp_bacteriap : * 

17: sp_ar cheap : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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24 


397 


18 


3 


88 


13 


Q90ZE7 


Q90ze7 acipenser b 


25 


393 


18 


1 


87 


13 


042306 


042306 carassius a 


26 


388 .5 


17 


9 


309 


4 


Q8WV88 


Q8wv88 homo sapien 


27 


383 


17 


7 


91 


6 


Q9MYZ1 


Q9myzl capra hircu 


28 


373 


17 


2 


86 


6 


Q28241 


Q28241 cervus elap 


29 


358 


16 


5 


81 


6 


Q9N1S3 


Q9nls3 capreolus c 


30 


340 


15 


7 


87 


13 


Q8JHB6 


Q8 jhb6 scophthalmu 


31 


336.5 


15 


5 


375 


13 


Q8UWD8 


Q8uwd8 columba liv 


32 


323 .5 


14 


9 


375 


13 


Q8AVB2 


Q8avb2 coturnix co 


33 


321.5 


14 


8 


375 


13 


Q8UWD7 


Q8uwd7 coturnix ch 


34 


321.5 


14 


8 


389 


13 


Q90YY0 


Q90yy0 ictalurus p 


35 


317 


14 


6 


77 


13 


Q90YF8 


Q90yf8 oncorhynchu 


36 


316 . 5 


14 


6 


375 


13 


Q98SP0 


Q98sp0 gallus gall 


37 


313 . 5 


14 . 


5 


375 


6 


Q9GM97 


Q9gm97 equus cabal 


38 


313 . 5 


14 . 


5 


375 


13 


Q8UWE0 


Q8uwe0 anas platyr 


39 


311.5 


14 


4 


375 


13 


Q8UWD9 


Q8uwd9 anser anser 


40 


310. 5 


14 


3 


375 


6 


Q8HY52 


Q8hy52 lepus capen 


41 


307.5 


14 . 


2 


375 


6 


Q95J86 


Q95j86 macaca fasc 


42 


302 


13 . 


9 


50 


6 


Q28240 


Q28240 cervus elap 


43 


301 


13 . 


9 


62 


13 


Q90ZJ7 


Q90zj7 anguilla an 



44 300 13.8 62 13 Q90YF4 Q90yf4 pleuronecte 

45 296.5 13.7 375 6 Q8WNS6 Q8wns6 bos taurus 



ALIGNMENTS 



RESULT 1 
Q9TUM8 

ID Q9TUM8 PRELIMINARY; PRT; 390 AA . 

AC Q9TUM8 ; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT O1-MAY-2O00 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

GN TGFB1 . 

OS Equus caballus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=9796; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Nixon A.J., Brower-Toland B.T., Sandell L.J.; 

RT "Molecular cloning of equine transforming growth factor beta 1 reveals 

RT a divergent nucleotide structure that encodes a novel bioactive 

RT peptide among mammalian species."/ 

RL Submitted (AUG- 1999) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF175709/ AAD49431.1; -. 

DR HSSP/ P01137; 1KLA. 

DR InterPro/ IPR002400/ GF_cysknot . 

DR InterPro/ IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003 911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF0 068 8; TGFb__propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PRO 14 23; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS0 025 0; TGF_BETA_1; 1. 

SQ SEQUENCE 390 AA; 43860 MW; 220FE4 0DFCCA6016 CRC64; 

Query Match 84.6%; Score 1835.5; DB 6; Length 390; 

Best Local Similarity 86.0%; Pred. No. 2e-155; 

Matches 350; Conservative 12; Mismatches 28; Indels 17; Gaps 1/ 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

i iimiimiiiimiiiiiiiiiiiiimiimmiiiiimiiiiiiiiiiimiii 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

Illllhllllllllllllllllll 'II I llllllllllllllllllll hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRAQVAGESAETEPEPEADYYAKEVTRVLMVEKENEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I : Ih II II MM llh hi II II MM Mill MM III III 1 1 hill 



Db 121 YKTVETGSHS I YMFFNAS ELRAAVPDPMLLSRAELRLLRLKLS VEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCSCDSKDNTLHVEINGFN 24 0 

1 1 Mill MIMIMIIIIMIIMM-.-. I-. II MINI llllllll I lllh 

Db 181 YLSNRLLTPSDSPEWLSFDVTGWRQWLSQGGAMEGLRLSAHCPCDSKDNTLRVGINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTN 300 

I MINIM 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 MINIM Mill 

Db 241 SSRRGDLATIDGMNRPFLLLMATPLERAQQLHSSRHRR ALDTN 283 

Qy 301 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 360 

II 1 1 ! 1 1 N 1 1 N IN 1 1 1 1 1 1 N IN I N 1 1 IN I NM 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 IM 

Db 2 84 YCSSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALY 34 3 

Qy 361 NQHNPGASAAPCCVPQALEPLPIVTYVGRKPKVEQLSNMI VRSCKCS 407 

IMIMMMMMM MMMMMMIMMMMMMMMI 

Db 344 NQHNPGASAAPCCVPQVLEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 

RESULT 2 
Q8R4D9 

ID Q8R4D9 PRELIMINARY; PRT ; 3 68 AA. 

AC Q8R4D9; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta-1 protein (Fragment) . 

GN TGFB1 . 

OS Sigmodon hispidus (Hispid cotton rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata ; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Sigmodont inae; 

OC Sigmodon. 

OX NCBI_TaxID=42415; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Blanco J.C., Pletneva L.M., Prince G . A . ; 

RT "Cotton rat cytokines, chemokines, and interferons."; 

RL Submitted (FEB-2002) to the EMBL/ GenBank / DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF480858; AAL87199.1; -. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 368 AA; 41905 MW; A5C91207B0468B4A CRC64 ; 

Query Match 81.0%; Score 1756.5; DB 11; Length 368; 

Best Local Similarity 84.9%; Pred. No. 2.1e-148; 

Matches 327; Conservative 16; Mismatches 25; Indels 17; Gaps 1; 
Qy 23 PGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYN 82 



, I I i ; 

Db 1 PGR PAAGLSTCKT I DMEL VKRKR I EAI RGQ I LS KLRLAS P PSQGE VP PGPLP EAVLALYN 6 0 

Qy 83 STRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELRE 142 

III I MMMIMMMIMMMh I Mil I MM ||||::|| 

Db 61 STRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI YDKTKDI PHSVYMFFNTSDI RE 12 0 

Qy 143 AVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTG 2 02 

Mill MIMMM I I I I I I I I I I : I I I h ! I I I I I I I I : I : I I ! I I I I I I I I 
Db 121 AVPEPPLLSRAELRLQRFKSNVEQHVELYEKYSNNSWRYLGNRLLSPTDSPEWLSFDVTS 18 0 

Qy 203 WRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMA 262 

llhll = = IMM 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 I ! : Mill III ! 1 1 1 ! 1 1 1 1 1 

Db 181 WRKWLNQGDG I QGFRFSAHCS CDS KDN I LH VE I NG I S PKRRGDLGT I HDMNRPFLLLMA 240 

Qy 263 TPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKD 3 22 

IMMMMMMMI 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 

Db 241 TPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYI DFRKD 2 83 

Qy 323 LGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP 3 82 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ! I , 1 1 . 1 1 1 1 1 1 ! 1 1 i ! 1 1 , 1 1 : 1 1 1 1 

Db 284 LGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASASPCCVPQALEPLP 343 

Qy 3 83 I VYYVGRKPKVEQLSNMIVRSCKCS 4 07 

I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 344 I VY Y VGRKPKVEQLSNMI VRS CKCS 368 



RESULT 3 
Q8JHF5 

ID Q8JHF5 PRELIMINARY; PRT; 379 AA. 

AC Q8JHF5; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata,- Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha; Perci formes; Percoidei; 

OC Sparidae; Sparus. 

OX NCB I JTaxI D= 8 175; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tafalla C. , Aranguren R., Secombes C.J. , Castrillo J.L., Novoa B. , 

RA Figueras A. ; 

RT "Molecular characterization of sea bream (Sparus aurata) transforming 

RT growth factor betal." ; 

RL Fish and Shellfish Immunol. 0:0-0(2002). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF510084; AAN03842.1; 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGF__TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFbjpropept ide ; 2. 



DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART ; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 379 AA; 43506 MW; C0C9D3D2FCA2 9C0E CRC64 ; 



Query Match 40.0%; Score 868.5; DB 13; Length 379; 

Best Local Similarity 46.2%; Pred. No. 4.5e-69; 

Matches 189; Conservative 71; Mismatches 104; Indels 45; Gaps 16; 

Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LS KLRL- AS PPSQGDVP P 70 

h h = = = I h :hllllhhhlhlllllll - I I ' I I I I II 

Db 3 LVFLMFMVAYTVGK-VSGMSTCKTLDLEMVKKKRIEAIRSQILSKLRLPTESPQAGD--E 59 

Qy 71 GPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDKFK 125 

:| ::|:|||||:: : = | = : | :|:| ||| = I : 
Db 60 EE I PSSLLSLYNSTKEMLKEQQTEVQTDIFTEQEEEEYFA TRVHKFNTTNPV 111 

Qy 126 GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV EQHVELYQKYSNDSWRYL 182 

II := I II Ihl :: : |h I I I I : I :| II Mill I III 

Db 112 RTPQNMSMSFNI SE I RRS I GDYRLLTTAELRML - I KAPTI LDEQRVELYQGLGT - SPRYL 169 

Qy 183 SNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSC--DSKDNTLHVEINGFN 240 

-I : HIIIIII == II : == hi = I I I • I hi 

Db 170 ASRFITNELRDKWLSFDVTETLQNWLKGNDDVQVFQLRLYCDCGRSSDVSTFSFGISGMT 22 9 

Qy 241 SGRRGDLATIHGMNR- - PFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALD 298 

:| Ml | : | : |::| |: | : || :|| :|: :| | 
Db 23 0 AG - RGDKAVLDDMTKQPP YI LTMS I PKNVSSHL - TSRKKRSTETK D 273 

Qy 299 TNYCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLA 358 

I I : II I I I I llllllllllllllhl lllhhl I llh : : I h = I I 

Db 274 T - - CTAQTE - TCCVRSLY I DFRKDLGWKW I HKPTR YHANYCMGS CTY I WNAENKYSQ I LA 330 

Qy 359 LYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 07 

II lllllll llllllllllllhllllh lllllllllhlllll 

Db 331 LYKHHNPGASAQPCCVPQALEPLP I LYYVGRQHKVEQLSNMI VKSCKCS 379 



RESULT 4 
Q8AXK8 

ID Q8AXK8 PRELIMINARY; PRT; 379 AA. 

AC Q8AXK8 ; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Act inopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha; Perci formes; Percoidei; 

OC Sparidae; Sparus. 

OX NCBI_TaxID=8175; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Tafalla C. , Novoa B., Aranguren R., Figueras A.; 



RT "Molecular cloning and characterization of sea bream (Sparus aurata) 

RT TGF beta 1 . " ; 

RL Submitted (SEP-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF424703; AAN76665.1; 

SQ SEQUENCE 379 AA; 43485 MW; A306EC387F6DBA7C CRC64 ; 

Query Match 39.9%; Score 864.5; DB 13; Length 379; 

Best Local Similarity 46.2%; Pred. No. le-68; 

Matches 189; Conservative 70; Mismatches 105; Indels 45; Gaps 16; 

Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRL - AS P PSQGDVPP 70 

h |:::: I h : h I I I I I : h h I I = I I I I I I I llllllll I II 

Db 3 LVFLMFMVAYTVGK- VSGMSTCKTLDLEMVKKKRI EAI RSQI LSKLRLPTESPQAGD- - E 59 

Qy 71 GPLPEAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVESGNQI YDKFK 125 

:| ::|:|||||:: : : | = : | :|:| ||| : | : 
Db 60 EEIPSSLLSLYNSTKEMLKEQQTEVQTDIFTEXEEEEYFA TRVHKFNTTNPV 111 

Qy 126 GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV EQHVELYQKYSNDSWRYL 182 

II - I II MM - = Ih MIIM I II Mill I Ml 

Db 112 RTPQNMSMSFNISEIRRSIGDYRLLTTAELRML- IKAPTILDEQRVELYQGLGT-SPRYL 169 

Qy 183 SNRLLAPSDS PEWLSFDVTG WRQWLTRREA I EGFRLSAHCSC - - DSKDNTLHVE I NGFN 240 

-I = I I I I I I I - || : =: hi Ml I M IM 

Db 170 ASRFITNELRDKWLSFDVTETLQNWLKGNDDVQVFQLRLYCDCGRSSDVSTFSFGISGMT 229 

Qy 241 SGRRGDLATI HGMNR - - PFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALD 2 98 

MUM: I : MM hi : I I Ml =|= M I 
Db 230 AG - RGDKAVLDDMTKQP PYI LTMS I PKNVSSHL - TSRKKRSTETK D 273 

Qy 299 TNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLA 358 

I I = M I I I I I I I I I I I I I I I I I I I : I I I Ih hi I llh : Mh M 

Db 274 T - - CTAQTE - TCCVRSLY I DFRKDLGWKW I HKPTR YHANYCMGS CT Y I WNAENKYSQXLA 330 

Qy 35 9 LYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

II I I I I I I I I I I I I I I I I I I I I ^ I I I II ^ I [[ I I I I I I I I I I I I 

Db 331 LYKHHNPGASAQPCCVPQALEPLPILYYVGRQHKVEQLSNMIVKSCKCS 379 



RESULT 5 
Q91YU7 

ID Q91YU7 PRELIMINARY; PRT; 412 AA. 

AC Q91YU7; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor, beta 3. 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria,- Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF -BETA FAMILY. 



DR EMBL; BC014690; AAH14690.1; -. 

DR MGD; MGI: 98727; Tgfb3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb__N . 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423 ; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 412 AA; 47144 MW; F3EB65D04 6DF32AD CRC64 ; 

Query Match 38.9%; Score 844.5; DB 11; Length 412; 

Best Local Similarity 44.2%; Pred . No. 7e-67; 

Matches 188; Conservative 57; Mismatches 127; Indels 53; Gaps 12; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

1= I M = I I I I hi MMIMIIIMIIIMM | | | : 

Db 9 L WLALLNLATI SLSLSTCTTLDFGH I KKKRVEA I RGQ I LS KLRLTS P PEPS VMT - - HVP 66 

Qy 7 5 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 12 6 

MINIMI : : II I : NMIMM = h= I- II 

Db 67 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 126 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: II | : : | ||| |:||: : || :||:| | 

Db 127 ITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQILRPDEHIAK 182 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSC DSK 228 

Ih : I : MINIM IMl MM I Mill 

Db 183 QRY I GG KNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLE I S I HCPCHTFQPNGDI LENV 242 

Qy 22 9 DNTLHVE I NGFNS GRRGDLATI HGM NR P FLLLMATPLERAQHLHSSRHRRALDT 282 

= == I MM : = I IMl I I h 
Db 243 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDSPGQGSQRK 2 98 

Qy 283 NSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGP 342 

MMMM : IMMII IMMIMIIMIMMIIIMIII II 

Db 2 99 KRALDTN YCFRNLEENCCVRPLY I DFRQDLGWKWVHEPKG Y YANFCSGP 34 7 

Qy 343 CPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVR 402 

MM I II M II III M 1 1 1 r 1 1 1 1 J MM IMMII llllllllhh 

Db 348 CPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWK 4 07 

Qy 4 03 SCKCS 4 07 

Mill 

Db 4 08 SCKCS 412 

RESULT 6 
Q9PWA9 

ID Q9PWA9 PRELIMINARY; PRT; 382 AA. 

AC Q9PWA9; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 



DT 01-MAY-2000 (TrEMBLrel . 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA. 

OS Morone chrysops x Morone saxatilis (white bass x striped bass) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Act inopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Perciformes; Percoidei; 

OC Moronidae; Morone. 

OX NCBI_TaxID=45352; 

RN [ 1 ] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=KIDNEY ; 

RX MEDLINE=20394636; PubMed=10938723 ; 

RA Harms C.A., Kennedy- Stoskopf S., Home W.A. , Fuller F.J., 

RA Tompkins W.A.F.; 

RT "Cloning and sequencing hybrid striped bass (Morone saxatilis x M. 

RT chrysops) transforming growth factor-beta (TGF-beta) , and development 

RT of a reverse transcription quantitative competitive polymerase chain 

RT reaction (RT-qcPCR) assay to measure TGF-beta mRNA of teleost fish."; 

RL Fish Shellfish Immunol. 10:61-85(2000). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 
CC RESPONSE . MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, D I SULFI DE - LI NKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: HIGHER LEVELS FOUND IN MONONUCLEAR CELLS FROM 
CC PERIPHERAL BLOOD THAN IN SPLEEN OR ANTERIOR KIDNEY. 



CC 


-!- SIMILARITY: 


BELONGS TO THE TGF-BETA FAMILY. 


DR 


EMBL; AF14 0363; 


AAD46997.1; - 




DR 


HSSP; P01137; 1KLA. 




DR 


InterPro; 


IPR002400; GF_cysknot . 


DR 


InterPro; 


IPR001 


83 9; TGFb . 




DR 


InterPro; 


IPR001111; TGFb N . 




DR 


InterPro; 


IPR003911; TGF TGFb. 


DR 


Pfam; PF00019; TGF-beta; 1. 




DR 


Pfam; PF00688; TGFb_propept ide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT. 




DR 


PRINTS; PR01423; 


TGFBETA. 




DR 


ProDom; PD000357; TGFb; 1. 




DR 


SMART; SM0 02 04; 


TGFB; 1. 




DR 


PROSITE; 


PS00250; TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 




POTENTIAL. 


FT 


PROPEP 


? 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


73 


73 


N-LINKED (GLCNAC. . , ) (POTENTIAL) 


FT 


CARBOHYD 


108 


108 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


124 


124 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


259 


259 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


235 


237 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


382 AA; 43846 MW; 


3124D8C34EA74D72 CRC64 ; 



Query Match 38.9%; Score 843; DB 13; Length 382; 

Best Local Similarity 45.0%; Pred. No. 8.5e-67; 

Matches 185; Conservative 67; Mismatches 107; Indels 52; Gaps 14; 



Qy 15 LLWLLVLTPGR PAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRL AS P PSQGDVP PG 71 

h = = l I I 'hllllhhhlhlllllll llllllll I II 

Db 6 LMLVWYTVGN - VSGMSTCKTLDLEMVKKKRI EAI RSQI LSKLRLPKEPEPDQAGDEEEI 64 

Qy 72 PLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDKFKG 12 6 

II =1=11111 = : = = I = : | : | : | | : : | | 
Db 65 PTP- --LLSLYNSTKEMLKEQQTEVQTDISTEQEEEEYFAKVLHKFNMTRKNN 114 

Qy 12 7 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV- - EQHVELYQKYSNDSWRYLSN 184 

I : I II Ihlhl : Ih 1111 = 1 I = M I I I I II Mh: 

Db 115 TDTTKKMFFNI SEI RESVGDYRLLTSAELRMLI KKTTI YDEQRVELYSGL-GDSPRYLAS 173 

Qy 185 RLLAPSDS PEWLS FDVTGWRQWLTRREAI EGFRLSAHCSCD S KDNTLHVE I NGFNS 241 

I = =1111111 == II = =11 = 1 11= = = 1 = 1 = 

Db 174 RF I TNKWKDKWLS FDVTKTLQDWLKGTDDEQGFQLRLFCECNKVSAGET I FKFG I SG I DP 233 

Qy 242 GRRGDLAT I HGMNR - - PFLLLMATPLERAQHLHS SRHRRALDTNSYPYDVPDYASLA 296 



Db 234 G-RGDTGPMQLLTQQPPYILTMSIP QNISSPSTSRKKRSTETK 275 

Qy 2 97 LDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKV 3 56 



Db 276 DVCTAQTE-TCCVRSLYIDFRKDLGWKWIHKPTGYNANYCMGSCTYIWNAENKYSQI 331 

Qy 357 LALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 07 

1 1 1 1 MMMI MMMMMMhMMh II I M M I M = M I M 

Db 332 LALYKHHNPGASAQPCCVPQALEPLP I LYYVGRQHKVEQLSNM I VKSCKCS 382 

RESULT 7 
Q91VP5 

ID Q91VP5 PRELIMINARY; PRT; 414 AA. 

AC Q91VP5; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 2. 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBIJTaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

KCJ TISSUE=Breast tumor; 

RA Strausberg R . ; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY . 

DR EMBL; BC011170; AAH11170.1; 

DR MGD; MGI: 98 726; Tgf b2 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb N. 



DR InterPro; IPR003 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide / 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 414 AA; 47588 MW; DB37A7C3 8 8 8 1F2 8 6 CRC64 ; 

Query Match 38.8%; Score 841; DB 11; Length 414; 

Best Local Similarity 43.8%; Pred. No. 1.4e-66; 

Matches 190; Conservative 62; Mismatches 120; Indels 62; Gaps 15; 
Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQILSKLRLASPPSQGDVP - P 70 

= 1 II II I 1 1 II h I h IIIIMMMIIIIhMM II I 

Db 5 VLSTFLLLHLVP - - VALSLSTCSTLDMDQFMRKR I EAI RGQI LSKLKLTS PPE - - DYPEP 60 

Qy 71 GPLPEAVLALYNSTRD RVAGESVEPEPE - PEADYYAKEVTRVLM VESGNQI YD 122 

:| = = : I I : HIIIII - I : I I I 

Db 61 DEVPPEVI S I YNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPSHLPSENAI PP 120 

Qy 123 KFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKV- EQHVELYQ K 173 

I h : | : | : : I HI I II I =| II -Mil 

Db 121 TFY - RPYFRI VRFDVSTMEKNASN LVKAEFRVFRLQNPKARVAEQR I ELYQI LKSKD 176 

Qy 174 YSNDSWRYLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFRLSAHCSC 225 

= : = 11= IMIMII | = =|| I I - I II I 

Db 177 LTSPTQRYIDSKWKTRAEGEWLSFDVTDAVQEWLHHKDRNLGFKISLHCPCCTFVPSNN 236 

Qy 226 DS KDNTLH VE 1 NGFNSGRRGDLAT I HGMNR PFLLLMATPLERAQHLHS 273 

: I I hi := MM: I I I I I I h I 

Db 237 YI I PNKSEELEARFAGIDGTSTYASGDQKTIKSTRKKTSGKTPHLLLMLLPSYRLESQQS 296 

Qy 274 SRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKG 333 

M III III : : I I hi I I I I h : I II I I I I I I I I I 

Db 297 SRRKKR ALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKG 340 

Qy 334 YHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKV 3 93 

1 = 111 I I II hi I III- llh II I II I I h I I I I I I I I I h I h I Ih 
Db 341 YNANFCAGACPYLWSSDTQHTKVLSLYNTINPEASASPCCVSQDLEPLTILYYIGNTPKI 4 00 

Qy 394 EQLSNMIVRSCKCS 4 07 

lllllllhlllll 
Db 4 01 EQLSNMIVKSCKCS 414 

RESULT 8 
093449 

ID 093449 PRELIMINARY; PRT; 382 AA . 

AC 093449; Q91217; 

DT 01-NOV-1998 (TrEMBLrel . 08, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA OR TGF . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii ; Salmonif ormes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8022 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=LEUKOCYTE; 

RX MEDLINE=99242020; PubMed=10227481 ; 

RA Daniels G.D., Secombes C.J.; 

RT "Genomic organisation of rainbow trout, Oncorhynchus mykiss TGF- 

RT BETA . " ; 

RL Dev. Comp. Immunol. 23:139-147(1999). 

RN [2] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=LEUKOCYTE; 

RX MEDLINE=98390168 ; PubMed=9722 928 ; 

RA Hardie L.J., Laing K.J., Daniels G.D., Grabowski P.S., Cunningham C. , 

RA Secombes C.J. ; 

RT "Isolation of the first piscine transforming growth factor beta gene: 

RT analysis reveals tissue specific expression and a potential regulatory 

RT sequence in rainbow trout (Oncorhynchus mykiss)."; 

RL Cytokine 10:555-563(1998). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE . MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE -LINKED (BY SIMILARITY). 

CC TISSUE SPECIFICITY: EXPRESSED IN BLOOD LEUKOCYTES, KIDNEY 

CC MACROPHAGES, BRAIN, GILL AND SPLEEN BUT NOT IN LIVER. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AJ007836; CAA07707.1; 

DR EMBL; X99303; CAA67685.1; -. 



DR 


HSSP; P01137; 1KLA. 




DR 


InterPro; 


IPR002400; GF_cysknot . 


DR 


InterPro; 


IPR001839; TGFb. 




DR 


InterPro ; 


IPR001111; TGFb N. 




DR 


InterPro; 


IPR0039 


11; TGF TGFb. 


DR 


Pfam; PF00019; TGF-beta; 1. 




DR 


Pfam; PF00688; TGFb_propept ide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT. 




DR 


PRINTS; PR01423; 


TGFB ETA . 




DR 


ProDom; PD000357; 


TGFb; 1. 




DR 


SMART; SM0 02 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1, 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


20 


POTENTIAL . 


FT 


PROPEP 


21 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


76 


76 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


116 


116 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


125 


125 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CONFLICT 


237 


237 


N -> D (IN REF. 2) . 


FT 


CONFLICT 


345 


345 


Q -> H (IN REF. 2) . 


FT 


CONFLICT 


371 


372 


LS -> VP (IN REF. 2) . 



FT CONFLICT 377 377 K - > M (IN REF . 2) . 

SQ SEQUENCE 382 AA; 44136 MW; 93BD4D354 0084B92 CRC64 ; 



Query Match 38.1%; Score 826; DB 13; Length 382; 

Best Local Similarity 46.8%; Pred . No. 2.8e-65; 

Matches 184; Conservative 57; Mismatches 104; Indels 48; Gaps 14; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPP SQGDVPPGPLPEAVLALYNSTRD 86 

: I IM - hi I I ! I I I I I I I I I I I Mi I I I | :|| :| :::::|||| : 

Db 23 MSTCKSLDLELVKRKRI EAIRGQILSKLRLPKEPEIDQEGDTE- -EVPASLMSI YNSTVE 80 

Qy 87 RVAGESVE PEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSEL 14 0 

- I I : I II hllll = I =| I : I : Ml lh 

Db 81 -LSEEQVHTYI PSTQDAEEEA- YFAKEVHKFNMKQSENT SKHQI - -LFNMSEM 129 

Qy 141 REAVPEPVLLSRAELRLL RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWL 196 

I : I MM I III I I II :Mh | IN : :: : |: 

Db 13 0 RSVLGTDRLLSQAELRLLI KNHGLLDDSEQRLELYRGV-GDKARYLKSHFVSKEWANRWV 18 8 

Qy 197 SFDVTGVVRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFNSGRRGDLATI - -HGMN 254 

Mill -• =11 MM I II MM I lh h 

Db 189 SFDVTQTLNEWLQGAGEEQGFQLKLPCDCGKPMEEFRFKISGMNK-LRGNTETLAMKMPS 247 

Qy 255 RPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQ 314 

M Mlh IMI I III M I II -III!: 

Db 248 KPHILLMSLPVERHSQL-SSRKKRQTTTE EIC-SDKSESCCVRK 289 

Qy 315 LYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCV 374 

I I h 1 1 II . 1 1 1 1 1 1 1 1 II MMMM llh : MIMIMI IMMM MM 

Db 2 90 LYIDFRKDLGWKWIHEPTGYFANYCIGPCTYIWNTENKYSQVLALYKHHNPGASAQPCCV 34 9 

Qy 375 PQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 407 

II lllllhllllh II II 1 1 1 1 1 h I h M 

Db 350 PQVLE PLP 1 1 Y YVGRQHKVEQLSNM I VKSCRCS 382 



RESULT 9 
Q9PTQ2 

ID Q9PTQ2 PRELIMINARY; PRT; 376 AA . 

AC Q9PTQ2; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

OS Cyprinus carpio (Common carp) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,- 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Cyprinus. 

OX NCBI__TaxID=7962 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Yin Z. , Kuang J. ; 

RT "Molecular cloning of carp transforming growth factor beta 1 . " ; 

RL Submitted (MAR- 19 99) to the EMBL/ GenBank/DDB J databases. 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS . 

CC -!- SUBUNIT: HOMODIMER, D I SULFIDE -LINKED (BY SIMILARITY). 



CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF136947; AAF22573.1; 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PR 003 911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb__propept ide; 1. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 22 POTENTIAL. 

FT PROPEP 23 264 

FT CHAIN 265 376 TRANSFORMING GROWTH FACTOR BETA. 

FT DISULFID 272 280 BY SIMILARITY. 

FT DISULFID 308 373 BY SIMILARITY. 

FT DISULFID 312 375 BY SIMILARITY. 

FT DISULFID 341 341 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 76 76 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 125 125 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 167 167 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 23 0 232 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 376 AA; 43329 MW; 7F7FC4DA58B6968 1 CRC64 ; 

Query Match 36.3%; Score 787; DB 13; Length 376; 

Best Local Similarity 43.8% ; Pred. No. 8.4e-62; 

Matches 180; Conservative 64; Mismatches 121; Indels 46; Gaps 15; 

Qy 6 LRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQ 65 

:h III I II : MM M M M I M I I I I I I I II I I I I I M I 

Db 1 MRVESLLLALQCLLGFV- -HYSGALSTCSPLDLELI KRKRI EAI RGQILSKLRLSKEPEV 58 

Qy 66 GDVP PG P - LP EAVLALYNST RDRVAGESVEPEPEP- EADYYAKEVTR- - VLMVESGN 118 

' I =:: = MII : I =h • I I =111111 = = -| 

Db 59 DEEKESQNI PAELI SVYNSTVELNEEQAAPPEQPKEDPVEEEYYAKEVHKFTI KLMEKNP 118 

Qy 119 QI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLK- - VEQHVELYQKYSN 176 

Ml = II : = -h Mill II Mill I 

Db 119 DKF LWFNITDISQTLGLNRI I SQVELRLLITTFPDGSEQRLELYQVIGN 167 

Qy 177 DSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHCS CDS KDNTLHVE I 236 

I III : I = = MMMM == II I II Mhl Ml : I : = l 

Db 168 KS-RYLESRFI - -PNQRKWLSFDVTQTLKDWLQRSEAEQGFQLKMADNCDPQ-KTFQLKI 223 

Qy 237 NGFNSGRRGDLATIH-GMNRPFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASL 295 

I III h I M MM: ||: : || ;| :|: | I 
Db 224 PGLVL-VRGDTETLAVNMPRPHILVMSLPLD-GNNSSKSRRKRQTETDQVCTDKSD 277 

Qy 296 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 355 

MM II MMMM Mill II I hi hi I I = = h = MM 

Db 278 GCCVRSLYI DFRKDLGWKWI HEPSGYYANYCTGSCSFVWTSENKYSQ 324 

Qy 356 VLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKC 4 06 

Mill I M II M II Ml I MM Mllh II 1 1 1 M 1 1 h M M 

Db 32 5 VTiALYKHHNPGASAQPCRVPQVLNPLPI FYYVGRQHKVEQLSNMI VKTCKC 375 



RESULT 10 
Q9ERB7 

ID Q9ERB7 PRELIMINARY; PRT; 399 AA . 

AC Q9ERB7; 

DT 01-MAR-2001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Transforming growth factor-beta 2 (Fragment) . 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBI__TaxI D= 10036; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Ramesh G., Kondaiah P., Seshagiri P.B.; 

RT "Differential expression and selective localization of transforming 

RT growth factor-beta isoforms in the hamster uterus during estrous 

RT cycle."; 

RL Submitted (AUG-2000) to the EMBL/ GenBank/ DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY . 

DR EMBL; AY007214; AAG02247.1; -. 

DR HSSP; P08112; 2TGI . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003 911; TGF_TGFb , 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 00250; TGF_BETA_1 ; 1. 

FT NON_TER 1 1 

FT NONJTER 3 99 3 99 

SQ SEQUENCE 399 AA; 46078 MW; A6FF8E65EAFD514 8 CRC64 ; 

Query Match 36.0%; Score 780; DB 11; Length 399; 

Best Local Similarity 42.6%; Pred. No. 3.8e-61; 

Matches 179; Conservative 58; Mismatches 121; Indels 62; Gaps 15; 

Qy 18 LLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVP-PGPLPEA 76 

II I I I Mil hi h II HUM Mill Ihl III III 
Db 4 LLHLVP - - VALSLSTCSTLDMDQFMRKRI EAI RGQI LSKLKLTSPPE - -DYPEPDEVPPE 59 

Qy 77 VLALYNSTRD RVAGESVEPEPE- PEADYYAKEVTRVLM VESGNQI YDKFKGTP 128 

h::|IMII = = = I I : Mil III :: I I I I I I 

Ub 60 VISIYNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPSHFPSENAIPPTFY-RP 118 

Qy 129 HSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKV- EQHVELYQ KYSNDSV? 179 

: : |: | : : I Ml h II I M II Mill - : 

Db 119 YFRI VRFDVSMMEKNASN LVKAEFRVFRLQNPKARVAEQRIELYQILKSKDLTSPTQ 175 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSC DSK 228 

11=: = := 1 1 MUM I Ml - I MM II I M 

Db 176 RYIDSKWKTRAEGEWLSFDVTDAVHEWLHHKDRNLGFKISLHCPCCTFVPFNNNIIPNK 23 5 



Qy 22 9 DNTLHVE INGFNSGRRGDLATIHGMNR PFLLLMATPLERAQHLHSSRHRRA 279 

I hi : I II I I I I I I I : hi = = 

Db 236 SEELEARFAGI DGTSQHSSGHQETI KSTRKKNSGKTPHLLLMLLPSYRLESQQSNRRKKR 295 

Qy 28 0 LDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQL.YIDFRKDLGWKWIHEPKGYHANFC 33 9 

III Ml : : 111=1 I II I I- I I , I I I I ' I I I I h I II I 
Db 2 96 ALDAAYCFRNVQDNCCLRPLYI DFKRDLGWKWI HEPKGYNANFC 33 9 

Qy 34 0 LGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM 399 

I M I : II lll-Mhlll II 1 1 h 1 1 1 1 MM hlhl Ihllllll 

Db 34 0 AGACPYLWSSDTQHTKVLSLYNTINPEASASPCCVSHDLEPLTILYYIGNTPKI EQLSNM 3 99 



RESULT 11 
Q99K17 

ID Q99K17 PRELIMINARY; PRT; 362 AA. 

AC Q99K17; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 3 (Fragment) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=100 90; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC005513; AAH05513.1; -. 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI: 98727; Tgf b3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGF BETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NON__TER 1 1 

SQ SEQUENCE 362 AA; 41486 MW; 08 08E4 618 0FDAE70 CRC64 ; 

Query Match 34.0%; Score 736.5; DB 11; Length 362; 

Best Local Similarity 42.9%; Pred . No. 2.6e-57; 

Matches 164; Conservative 51; Mismatches 114; Indels 53; Gaps 12; 

Qy 58 RLASPPSQGDVPPGPLPEAVLALYNSTR DRVAGESVE - - PEPEPEADYYAKEVTRVL 112 

h Ml : I Ml Mill I = = II I : 1-1 I III: : 

Db 2 RVGSPPEPSVMT- -HVPYQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFD 59 



Qy 113 MVE SGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVE 165 

I- | = = II :: M | : : | ||| |:||: : | 

Db 60 M I QGLAEHNELAVCPKG I TSKVFR - FNVSS VEK NGTNLFRAEFRVLRVPNPSSKRTE 115 

Qy 166 QHVELYQKYSND SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSA 221 

I =lhl I I h • I = llllllll 11 = 11 llh I :| 

Db 116 QRIELFQILRPDEHIAKQRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISI 175 

Qy 222 HCSC DSKDNTLHVEINGFNS GRRGDLATIHGM NRPFLLLMATPL 265 

III :: : :: | :: |||| : : | |:|| | 

Db 176 HCPCHTFQPNGDI LENVHEVMEI KFKGVDNEDDHGRGDLGRLKKQKDHHNPHLI LMMI PP 235 

Qy 266 ERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDLGW 32 5 

I h llllllll : hlllll lllllhllil 

Db 23 6 HRLDSPGQGSQRK KRALDTNYCFRNLEENCCVRPLYIDFRQDLGW 28 0 

Qy 326 KWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPGCVPQALEPLPIVY 385 

Ihlllllhllll llllh I II =1 M Ml II llhllllll I I M hi 

Db 281 KWVHEPKGYYANFCSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILY 34 0 

Qy 386 YVGRKPKVEQLSNMIVRSCKCS 4 07 

I I I I I I I I I I I I h I = I I I I I 
Db 341 YVGRTPKVEQLSNMWKSCKCS 362 

RESULT 12 
Q98854 

ID Q98854 PRELIMINARY; PRT; 361 AA. 

AC Q98854; 

DT 01-FEB-1997 (TrEMBLrel . 02, Created) 

DT 01-FEB-1997 (TrEMBLrel. 02, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 2 (TGF-beta 2) (Fragment) . 

GN TGFB2 . 

OS Cyprinus carpio (Common carp) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae ; Cyprinus . 

OX NCBI_TaxID=7962; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TI SSUE=HEART ; 

RX MEDLINE=97354301; PubMed=9210595 ; 

RA Sumathy K. , Desai K.V., Kondaiah P.; 

RT "Isolation of transforming growth factor-beta2 cDNA from a fish, 

RT Cyprinus carpio by RT-PCR." ; 

RL Gene 191:103-107(1997). 

CC -!- FUNCTION: TGF-BETA 2 HAS SUPPRESSIVE EFFECTS ON I NTERLEUKI N - 2 
CC DEPENDENT T-CELL GROWTH. 

CC -!- SUBUNIT: HOMODIMER, D I SULF I DE - L I N KED (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; U66874; AAB62983.1; 

DR HSSP; P08112; 2TGI . 

DR Inter Pro; IPR00183 9; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 



DR Pfam; PF0068 8; TGFb__propept ide ; 1. 

DR PRINTS; PR0142 3; TGFBETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART ; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA__1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 
FT 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



NON_TER 


1 


1 






PROPEP 


<1 


257 






CHAIN 


258 


361 




TRANSFORMING GROWTH FACTOR BETA 2. 


DISULFID 


264 


273 




BY SIMILARITY . 


DISULFID 


272 


335 




BY SIMILARITY. 


DISULFID 


334 


334 




INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


30 


30 




N-LINKED (GLCNAC . . .) (POTENTIAL) 


CARBOHYD 


98 


98 




N-LINKED ( GLCNAC . . .) (POTENTIAL) 


CARBOHYD 


199 


199 




N-LINKED (GLCNAC . . .) (POTENTIAL) 


NON TER 


361 


361 






) SEQUENCE 


361 AA; 


41931 MW; 


94D930FA970A3FD3 CRC64 ; 


Query Match 




32 . 


.0%; 


Score 6 93.5; DB 13; Length 3 61; 


Best Local Similarity 


40, 


■ 9%; 


Pred. No. 1.8e-53; 



Matches 157; Conservative 54; Mismatches 114; Indels 59; Gaps 14, 

Qy 52 QILSKLRLASPPSQGDVPPGP- -LPEAVLALYNSTRDRVAGESVEPEPEPE ADYY 104 

III 11 = 1= II == I I : ::|:|||||| = - I I Ml 

Db 1 QILCKLKLSCPP EI YPEPEEVSRDI IAI YNSTRDLLQEKANERAATCERQRTGEEYY 57 

Qy 105 AKEVTRVLM VESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL- 160 

I M I I Mil h : | : | : : | :||||: || 

Db 58 AKEVHKI DMQPFYPAENVI PTKHY-NPYFRRLRFDVSSMEKNASN LVKAELRIFRLQ 113 

Qy 161 - -KLKV-EQHVELYQ KYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRRE 212 

I =1 II Mill :: : ||: :::: |||||||| | :|| |: 

Db 114 N PKARVS EQR I EL YQ I LGHKDLTS PTQR Y I DS KWRTRTEGEWLS FDVTEAVS EWLLHRD 173 

Qy 213 AIEGFRLSAHCSC DS KDNTLHVE I NGFNSG - - RRGDLAT I HGMNR 255 

IhM III ' I I I - Ml I 

Db 174 RNNGFKI SLHCPCCTFVPSNNYI I PNKSEELEARFAGI DDSFVHGGDLKMFKKRRHSGQS 233 

Qy 2 56 PFLLLMATPLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQL 315 

I I I I I I h I I Ih III :|| : : I I I = J I 

Db 234 PHLLLMLLPSYRLESQHKS - HRQ KRALDAAFCFRNVQDNCCLRSL 277 

Qy 316 YIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVP 375 

Nihil MM II I II I Ih I III I I I hi I I I hi :| Ml M MM' I II I 
Db 278 YIDFKKDLGWKWIHEPKGYNANFCAGACPYLWSADTQHSNILGLYNTINPEASASPCCVS 337 

Qy 376 QALEPLP I VYYVGRKPKVEQLSNM 3 99 

I MM I : I I : I : Ihllllll 
Db 338 QDLEPLTILYYIGKTPKIEQLSNM 361 



RESULT 13 
Q08714 

ID Q08714 PRELIMINARY; PRT; 130 AA. 

AC Q08714; 070331; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 



DT 01-JUN-2002 (TrEMBLrel . 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1. 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBI_TaxID=10036; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=LVG (SYR) ; 

RX MEDLINE=93304479; PubMed=8 3 17544 ; 

RA Wong D.T., Donoff R.B., Yang J. , Song B.Z., Matossian K. , Nagura N . , 

RA Elovic A., McBride J . , Gallagher G. , Todd R. ; 

RT "Sequential expression of transforming growth factors alpha and beta 

RT by eosinophils during cutaneous wound healing in the hamster."; 

RL Am. J. Pathol. 143:130-142(1993). 

RN [2] 

RP SEQUENCE OF 26-115 FROM N.A. 

RC STRAIN=SYRIAN; TISSUE=SPLEEN ; 

RX MEDLINE=98234044 ; PubMed=9573 100 ; 

RA Melby P.C., Tryon V.V., Chandrasekar B., Freeman G.L.; 

RT "Cloning of Syrian hamster (Mesocricetus auratus) cytokine cDNAs and 

RT analysis of cytokine mRNA expression in experimental visceral 

RT leishmaniasis . " ; 

RL Infect. Immun. 66:2135-2142(1998). 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 
CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC - ! - SUBUNIT: HOMODIMER, DI SULFIDE -LINKED . 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X60296; CAA42838.1; -. 

DR EMBL; AF046214; AAC40099.1; -. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NON__TER 


1 


1 




FT 


PROPEP 


<1 


18 




FT 


CHAIN 


19 


130 


TRANSFORMING GROWTH FACTOR 


FT 


DISULFID 


25 


34 


BY SIMILARITY. 


FT 


DISULFID 


33 


96 


BY SIMILARITY. 


FT 


DISULFID 


66 


129 


BY SIMILARITY. 


FT 


DISULFID 


95 


95 


INTERCHAIN (BY SIMILARITY) 


FT 


CONFLICT 


93 


93 


G -> S (IN REF. 2) . 


SQ 


SEQUENCE 


130 AA; 


14997 


MW; 8B41DD6CF3 9CCA77 CRC64 ; 



Query Match 31.9%; Score 691.5; DB 11; Length 130; 

Best Local Similarity 86.4%; Pred . No. 6.3e-54; 

Matches 127; Conservative 0; Mismatches 3; Indels 17; Gaps 



Qy 261 MATPLERAQHLHSSRHRRALDTNSYPYDVPDYASIALDTNYCFSSTEKNCCVRQLYIDFR 320 

Illllllllll llllll IIIMIIIIIIIIIIIIIIIIIIII 
Db 1 MATPLERAQHLQSSRHRR ALDTNYCFSSTEKNCCVRQLYI DFR 4 3 

Qy 321 KDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEP 380 

I II. il 1 1 II III 1 1 1 1 II I i! II M 1 1 M 1 1 1 III M I II 1 1 II llllllllll 

Db 44 KDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVXALYNQHNPGASAGPCCVPQALEP 103 

Qy 381 LPIVYYVGRKPKVEQLSNMI VRSCKCS 407 

II M 1 11 M 1 1 M II M 1 M I M Ml 

Db 104 LPIVYYVGRKPKVEQLSNMI VRSYKCS 130 

RESULT 14 
Q95N80 

ID Q95N8 0 PRELIMINARY; PRT; 124 AA . 

AC Q95N80; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel . 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Fonfara S., Groene A., Baumgaertner W. ; 

RT "Sequence of canine transforming growth factor beta 1 mRNA in DH82- 

RT cells."; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF349538; AAK54072.1; 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 00250; TGF_BETA_1 ; 1. 

FT NONJTER 1 1 

FT NON_TER 124 124 

SQ SEQUENCE 124 AA; 14329 MW; 2 1D18 52 18E5 556DB CRC64 ; 

Query Match 31.1%; Score 674.5; DB 6; Length 124; 

Best Local Similarity 87.2%; Pred. No. 1.9e-52; 

Matches 123; Conservative 0; Mismatches 1; Indels 17; Gaps 1; 
Qy 264 PLERAQHLHSSRHRRALDTNSYPYDVPDYASLALDTNYCFSSTEKNCCVRQLYIDFRKDL 323 

I M M M II I II II 1 1 II 1 1 1 II I M M 1 1 M II 1 1 ! I ! ! M 

Db 1 PLERAQHLHSSRQRR ALDTNYCFSSTEKNCCVRQLYI DFRKDL 43 

Qy 324 GWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI 383 

1 1 1 1 1 1 1 1 1 1 r 1 1 1 f 1 1 1 r r f 1 1 1 1 j f 1 1 1 1 1 j 1 1 1 1 ] 1 1 1 1 1 1 f 1 1 1 1 1 1 1 f 1 1 1 1 f 1 1 

Db 44 GWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI 103 

Qy 384 VY Y VGR K P KVEQLSNM I VRS C 404 

I I I I I II I I I I I I I I I I I I I I 
Db 104 VYYVGRKPKVEQLSNMIVRSC 124 



RESULT 15 
002730 

ID 002730 PRELIMINARY; PRT; 112 AA . 

AC 002730; 097501; 

DT 01- JUL- 1997 (TrEMBLrel . 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 OR TGF-BETA- 1 . 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata ; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_Tax I D= 9 9 8 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Taylor T.K., James E.R., McGonigle S., Yoho E.R.; 

RL Submitted (APR-1997) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE OF 2-99 FROM N.A. 

RA Inoue K. , Kawabe Y. , Kodama T. ; 

RL Submitted (NOV-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE -LINKED (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF000133; AAB53806.1; 

DR EMBL; AB020217; BAA36950.1; 

DR HSSP; P01137; UCLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR00183 9; TGFb. 

DR Pfam ; PF00019; TGF-beta; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NON_TER 


1 


1 




FT 


CHAIN 


1 


112 


TRANSFORMING GROWTH FACTOR BETA 1 


FT 


DISULFID 


7 


16 


BY SIMILARITY. 


FT 


DISULFID 


15 


78 


BY SIMILARITY. 


FT 


DISULFID 


44 


109 


BY SIMILARITY. 


FT 


DISULFID 


48 


111 


BY SIMILARITY. 


FT 


DISULFID 


77 


77 


INTERCHAIN (BY SIMILARITY) . 


FT 


CONFLICT 


2 


3 


LD -> FS (IN REF. 2) . 


FT 


CONFLICT 


85 


92 


PLPIVYYV -> ATAHRVTTL (IN REF . 2) 


SQ 


SEQUENCE 


112 AA; 


12795 MW; 


53C5B7D46355A6F3 CRC64 ; 


Query Match 




29.4%; 


Score 63 8; DB 6 ; Length 112; 



Best Local Similarity 100.0%; Pred. No. 3.1e-49; 

Matches 112; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 296 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 355 

IIIIIIIMIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMM 

Db 1 ALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSK 60 

Qy 356 VLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 407 

' I I , I ' I I I II I I I I I I I I I M I I I I I I I I I M 1 I. I I I I I I I I I I I I I I I 
Db 61 VLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 112 



Search completed: October 28, 2003, 09:12:27 
Job time : 33.6306 sees 



